Overview
Brought to you by YData
Dataset statistics
| Number of variables | 172 |
|---|---|
| Number of observations | 2361473 |
| Missing cells | 246278207 |
| Missing cells (%) | 60.6% |
| Total size in memory | 3.0 GiB |
| Average record size in memory | 1.3 KiB |
Variable types
| Text | 172 |
|---|
Dataset
| Description | NMNH Extant Specimen Records (USNM, US) 0049395-241126133413365 |
|---|---|
| URL | https://doi.org/10.15468/dl.42mnjx |
license has constant value "CC0_1_0" | Constant |
publisher has constant value "National Museum of Natural History, Smithsonian Institution" | Constant |
datasetName has constant value "NMNH Extant Biology" | Constant |
eventType has constant value "Baffin Island" | Constant |
samplingEffort has constant value "67.0" | Constant |
fieldNotes has constant value "-63.0" | Constant |
municipality has constant value "-53.33" | Constant |
coordinatePrecision has constant value "Leeward Is." | Constant |
geologicalContextID has constant value "3" | Constant |
earliestEonOrLowestEonothem has constant value "29" | Constant |
earliestEraOrLowestErathem has constant value "Plantae" | Constant |
latestEraOrHighestErathem has constant value "Tracheophyta" | Constant |
earliestPeriodOrLowestSystem has constant value "Magnoliopsida" | Constant |
earliestEpochOrLowestSeries has constant value "5410907" | Constant |
latestAgeOrHighestStage has constant value "North Atlantic Ocean" | Constant |
lowestBiostratigraphicZone has constant value "Scharf, U." | Constant |
identificationID has constant value "Baja California Norte" | Constant |
dateIdentified has constant value "Asterales" | Constant |
identificationReferences has constant value "Guatteria punctata (Aubl.) R.A.Howard" | Constant |
scientificNameID has constant value "69.0" | Constant |
parentNameUsageID has constant value "Campanula" | Constant |
originalNameUsageID has constant value "Plantae, Dicotyledonae (basal), Magnoliales, Annonaceae, Annonoideae" | Constant |
nameAccordingToID has constant value "Plantae" | Constant |
nameAccordingTo has constant value "6" | Constant |
relativeOrganismQuantity has constant value "3034046" | Constant |
catalogNumber has 213212 (9.0%) missing values | Missing |
recordNumber has 1045439 (44.3%) missing values | Missing |
recordedBy has 498671 (21.1%) missing values | Missing |
sex has 2009611 (85.1%) missing values | Missing |
lifeStage has 2107148 (89.2%) missing values | Missing |
preparations has 1223408 (51.8%) missing values | Missing |
associatedSequences has 2358372 (99.9%) missing values | Missing |
occurrenceRemarks has 2047572 (86.7%) missing values | Missing |
verbatimLabel has 2361471 (> 99.9%) missing values | Missing |
materialSampleID has 2361471 (> 99.9%) missing values | Missing |
eventType has 2361472 (> 99.9%) missing values | Missing |
fieldNumber has 2164715 (91.7%) missing values | Missing |
eventDate has 419648 (17.8%) missing values | Missing |
startDayOfYear has 669491 (28.4%) missing values | Missing |
endDayOfYear has 669490 (28.4%) missing values | Missing |
year has 423106 (17.9%) missing values | Missing |
month has 542654 (23.0%) missing values | Missing |
day has 762160 (32.3%) missing values | Missing |
verbatimEventDate has 1255739 (53.2%) missing values | Missing |
habitat has 2177646 (92.2%) missing values | Missing |
samplingEffort has 2361472 (> 99.9%) missing values | Missing |
fieldNotes has 2361472 (> 99.9%) missing values | Missing |
locationID has 2084512 (88.3%) missing values | Missing |
higherGeography has 73521 (3.1%) missing values | Missing |
continent has 411637 (17.4%) missing values | Missing |
waterBody has 1923759 (81.5%) missing values | Missing |
islandGroup has 2309219 (97.8%) missing values | Missing |
island has 2204401 (93.3%) missing values | Missing |
countryCode has 95309 (4.0%) missing values | Missing |
stateProvince has 637065 (27.0%) missing values | Missing |
county has 1825433 (77.3%) missing values | Missing |
municipality has 2361472 (> 99.9%) missing values | Missing |
locality has 337166 (14.3%) missing values | Missing |
verbatimElevation has 2293088 (97.1%) missing values | Missing |
verbatimDepth has 2347005 (99.4%) missing values | Missing |
decimalLatitude has 1649765 (69.9%) missing values | Missing |
decimalLongitude has 1649765 (69.9%) missing values | Missing |
coordinateUncertaintyInMeters has 2318351 (98.2%) missing values | Missing |
coordinatePrecision has 2361472 (> 99.9%) missing values | Missing |
pointRadiusSpatialFit has 2361470 (> 99.9%) missing values | Missing |
verbatimCoordinateSystem has 2103318 (89.1%) missing values | Missing |
verbatimSRS has 2361467 (> 99.9%) missing values | Missing |
footprintSRS has 2361470 (> 99.9%) missing values | Missing |
footprintSpatialFit has 2361469 (> 99.9%) missing values | Missing |
georeferencedBy has 2361464 (> 99.9%) missing values | Missing |
georeferencedDate has 2361470 (> 99.9%) missing values | Missing |
georeferenceProtocol has 2055868 (87.1%) missing values | Missing |
georeferenceSources has 2361471 (> 99.9%) missing values | Missing |
georeferenceRemarks has 2309427 (97.8%) missing values | Missing |
geologicalContextID has 2361472 (> 99.9%) missing values | Missing |
earliestEonOrLowestEonothem has 2361472 (> 99.9%) missing values | Missing |
latestEonOrHighestEonothem has 2361470 (> 99.9%) missing values | Missing |
earliestEraOrLowestErathem has 2361471 (> 99.9%) missing values | Missing |
latestEraOrHighestErathem has 2361471 (> 99.9%) missing values | Missing |
earliestPeriodOrLowestSystem has 2361471 (> 99.9%) missing values | Missing |
latestPeriodOrHighestSystem has 2361471 (> 99.9%) missing values | Missing |
earliestEpochOrLowestSeries has 2361472 (> 99.9%) missing values | Missing |
latestEpochOrHighestSeries has 2361465 (> 99.9%) missing values | Missing |
earliestAgeOrLowestStage has 2361468 (> 99.9%) missing values | Missing |
latestAgeOrHighestStage has 2361472 (> 99.9%) missing values | Missing |
lowestBiostratigraphicZone has 2361472 (> 99.9%) missing values | Missing |
highestBiostratigraphicZone has 2361470 (> 99.9%) missing values | Missing |
lithostratigraphicTerms has 2361463 (> 99.9%) missing values | Missing |
group has 2361468 (> 99.9%) missing values | Missing |
formation has 2361470 (> 99.9%) missing values | Missing |
member has 2361471 (> 99.9%) missing values | Missing |
bed has 2361466 (> 99.9%) missing values | Missing |
identificationID has 2361472 (> 99.9%) missing values | Missing |
verbatimIdentification has 2361470 (> 99.9%) missing values | Missing |
identificationQualifier has 2352474 (99.6%) missing values | Missing |
typeStatus has 2274525 (96.3%) missing values | Missing |
identifiedBy has 1955406 (82.8%) missing values | Missing |
identifiedByID has 2361470 (> 99.9%) missing values | Missing |
dateIdentified has 2361472 (> 99.9%) missing values | Missing |
identificationReferences has 2361472 (> 99.9%) missing values | Missing |
identificationVerificationStatus has 2361466 (> 99.9%) missing values | Missing |
identificationRemarks has 2361467 (> 99.9%) missing values | Missing |
taxonID has 2361471 (> 99.9%) missing values | Missing |
scientificNameID has 2361472 (> 99.9%) missing values | Missing |
parentNameUsageID has 2361472 (> 99.9%) missing values | Missing |
originalNameUsageID has 2361472 (> 99.9%) missing values | Missing |
nameAccordingToID has 2361472 (> 99.9%) missing values | Missing |
namePublishedInID has 2361469 (> 99.9%) missing values | Missing |
taxonConceptID has 2361471 (> 99.9%) missing values | Missing |
acceptedNameUsage has 2361470 (> 99.9%) missing values | Missing |
parentNameUsage has 2361469 (> 99.9%) missing values | Missing |
originalNameUsage has 2361471 (> 99.9%) missing values | Missing |
nameAccordingTo has 2361471 (> 99.9%) missing values | Missing |
namePublishedIn has 2361470 (> 99.9%) missing values | Missing |
namePublishedInYear has 2361470 (> 99.9%) missing values | Missing |
class has 138563 (5.9%) missing values | Missing |
order has 145729 (6.2%) missing values | Missing |
superfamily has 2361471 (> 99.9%) missing values | Missing |
family has 52497 (2.2%) missing values | Missing |
subfamily has 2361471 (> 99.9%) missing values | Missing |
subtribe has 2361470 (> 99.9%) missing values | Missing |
genus has 120652 (5.1%) missing values | Missing |
genericName has 120743 (5.1%) missing values | Missing |
subgenus has 2361470 (> 99.9%) missing values | Missing |
infragenericEpithet has 2361471 (> 99.9%) missing values | Missing |
specificEpithet has 306545 (13.0%) missing values | Missing |
infraspecificEpithet has 2138642 (90.6%) missing values | Missing |
cultivarEpithet has 2361470 (> 99.9%) missing values | Missing |
verbatimTaxonRank has 2361470 (> 99.9%) missing values | Missing |
vernacularName has 2361469 (> 99.9%) missing values | Missing |
nomenclaturalCode has 2361468 (> 99.9%) missing values | Missing |
nomenclaturalStatus has 2361469 (> 99.9%) missing values | Missing |
taxonRemarks has 2361470 (> 99.9%) missing values | Missing |
elevation has 1813940 (76.8%) missing values | Missing |
elevationAccuracy has 2160162 (91.5%) missing values | Missing |
depth has 2098489 (88.9%) missing values | Missing |
depthAccuracy has 2120420 (89.8%) missing values | Missing |
distanceFromCentroidInMeters has 2356831 (99.8%) missing values | Missing |
mediaType has 863248 (36.6%) missing values | Missing |
classKey has 138564 (5.9%) missing values | Missing |
orderKey has 145723 (6.2%) missing values | Missing |
familyKey has 52492 (2.2%) missing values | Missing |
genusKey has 120649 (5.1%) missing values | Missing |
subgenusKey has 2361466 (> 99.9%) missing values | Missing |
speciesKey has 306496 (13.0%) missing values | Missing |
species has 306502 (13.0%) missing values | Missing |
verbatimScientificName has 94306 (4.0%) missing values | Missing |
typifiedName has 2361471 (> 99.9%) missing values | Missing |
repatriated has 92313 (3.9%) missing values | Missing |
relativeOrganismQuantity has 2361472 (> 99.9%) missing values | Missing |
projectId has 2361467 (> 99.9%) missing values | Missing |
gbifRegion has 114374 (4.8%) missing values | Missing |
level0Gid has 1911133 (80.9%) missing values | Missing |
level0Name has 1911134 (80.9%) missing values | Missing |
level1Gid has 1912772 (81.0%) missing values | Missing |
level1Name has 1912766 (81.0%) missing values | Missing |
level2Gid has 1927752 (81.6%) missing values | Missing |
level2Name has 1927850 (81.6%) missing values | Missing |
level3Gid has 2259567 (95.7%) missing values | Missing |
level3Name has 2260777 (95.7%) missing values | Missing |
iucnRedListCategory has 383090 (16.2%) missing values | Missing |
gbifID has unique values | Unique |
occurrenceID has unique values | Unique |
Reproduction
| Analysis started | 2025-01-08 22:43:58.844868 |
|---|---|
| Analysis finished | 2025-01-08 22:46:14.905392 |
| Duration | 2 minutes and 16.06 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
gbifID
Text
Unique 
| Distinct | 2361473 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.0 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 2361473 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1321585620 |
|---|---|
| 2nd row | 2452323322 |
| 3rd row | 1321585780 |
| 4th row | 1320143695 |
| 5th row | 2397792128 |
| Value | Count | Frequency (%) |
| 1321585620 | 1 | < 0.1% |
| 1320155873 | 1 | < 0.1% |
| 2549497867 | 1 | < 0.1% |
| 1320145763 | 1 | < 0.1% |
| 1321586167 | 1 | < 0.1% |
| 1321585780 | 1 | < 0.1% |
| 1320143695 | 1 | < 0.1% |
| 2397792128 | 1 | < 0.1% |
| 1320143630 | 1 | < 0.1% |
| 1321585990 | 1 | < 0.1% |
| Other values (2361463) | 2361463 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 4191710 | |
| 3 | 3357958 | |
| 2 | 3144111 | |
| 5 | 1911139 | |
| 8 | 1885660 | |
| 7 | 1878917 | |
| 0 | 1861726 | |
| 4 | 1811401 | |
| 9 | 1790742 | |
| 6 | 1781366 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 23614730 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 4191710 | |
| 3 | 3357958 | |
| 2 | 3144111 | |
| 5 | 1911139 | |
| 8 | 1885660 | |
| 7 | 1878917 | |
| 0 | 1861726 | |
| 4 | 1811401 | |
| 9 | 1790742 | |
| 6 | 1781366 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 23614730 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 4191710 | |
| 3 | 3357958 | |
| 2 | 3144111 | |
| 5 | 1911139 | |
| 8 | 1885660 | |
| 7 | 1878917 | |
| 0 | 1861726 | |
| 4 | 1811401 | |
| 9 | 1790742 | |
| 6 | 1781366 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23614730 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 4191710 | |
| 3 | 3357958 | |
| 2 | 3144111 | |
| 5 | 1911139 | |
| 8 | 1885660 | |
| 7 | 1878917 | |
| 0 | 1861726 | |
| 4 | 1811401 | |
| 9 | 1790742 | |
| 6 | 1781366 |
license
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.0 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CC0_1_0 |
|---|---|
| 2nd row | CC0_1_0 |
| 3rd row | CC0_1_0 |
| 4th row | CC0_1_0 |
| 5th row | CC0_1_0 |
| Value | Count | Frequency (%) |
| cc0_1_0 | 2361473 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 4722946 | |
| 0 | 4722946 | |
| _ | 4722946 | |
| 1 | 2361473 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7084419 | |
| Uppercase Letter | 4722946 | |
| Connector Punctuation | 4722946 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4722946 | |
| 1 | 2361473 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 4722946 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 4722946 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 11807365 | |
| Latin | 4722946 | 28.6% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 4722946 | |
| _ | 4722946 | |
| 1 | 2361473 |
Latin
| Value | Count | Frequency (%) |
| C | 4722946 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16530311 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 4722946 | |
| 0 | 4722946 | |
| _ | 4722946 | |
| 1 | 2361473 |
modified
Text
| Distinct | 231346 |
|---|---|
| Distinct (%) | 9.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.0 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 20 |
| Mean length | 20 |
| Min length | 20 |
Unique
| Unique | 103346 ? |
|---|---|
| Unique (%) | 4.4% |
Sample
| 1st row | 2023-05-10T09:22:00Z |
|---|---|
| 2nd row | 2022-01-03T14:31:00Z |
| 3rd row | 2022-08-17T11:23:00Z |
| 4th row | 2022-12-30T12:34:00Z |
| 5th row | 2019-07-10T10:37:00Z |
| Value | Count | Frequency (%) |
| 2017-04-17t11:48:00z | 2463 | 0.1% |
| 2017-04-17t11:49:00z | 2417 | 0.1% |
| 2024-09-25t13:44:00z | 2393 | 0.1% |
| 2024-09-25t13:46:00z | 2237 | 0.1% |
| 2017-04-17t11:50:00z | 2230 | 0.1% |
| 2024-09-25t17:07:00z | 2222 | 0.1% |
| 2024-09-25t17:02:00z | 2213 | 0.1% |
| 2017-04-17t11:47:00z | 2206 | 0.1% |
| 2024-09-25t13:45:00z | 2193 | 0.1% |
| 2024-09-25t17:05:00z | 2193 | 0.1% |
| Other values (231336) | 2338706 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 11717324 | |
| 2 | 6603970 | |
| 1 | 5691162 | |
| - | 4722946 | |
| : | 4722946 | |
| T | 2361473 | 5.0% |
| Z | 2361473 | 5.0% |
| 4 | 1546513 | 3.3% |
| 3 | 1539433 | 3.3% |
| 5 | 1442686 | 3.1% |
| Other values (4) | 4519534 | 9.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 33060622 | |
| Dash Punctuation | 4722946 | 10.0% |
| Other Punctuation | 4722946 | 10.0% |
| Uppercase Letter | 4722946 | 10.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 11717324 | |
| 2 | 6603970 | |
| 1 | 5691162 | |
| 4 | 1546513 | 4.7% |
| 3 | 1539433 | 4.7% |
| 5 | 1442686 | 4.4% |
| 9 | 1388747 | 4.2% |
| 8 | 1153529 | 3.5% |
| 7 | 1074741 | 3.3% |
| 6 | 902517 | 2.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 2361473 | |
| Z | 2361473 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4722946 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 4722946 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 42506514 | |
| Latin | 4722946 | 10.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 11717324 | |
| 2 | 6603970 | |
| 1 | 5691162 | |
| - | 4722946 | |
| : | 4722946 | |
| 4 | 1546513 | 3.6% |
| 3 | 1539433 | 3.6% |
| 5 | 1442686 | 3.4% |
| 9 | 1388747 | 3.3% |
| 8 | 1153529 | 2.7% |
| Other values (2) | 1977258 | 4.7% |
Latin
| Value | Count | Frequency (%) |
| T | 2361473 | |
| Z | 2361473 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 47229460 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 11717324 | |
| 2 | 6603970 | |
| 1 | 5691162 | |
| - | 4722946 | |
| : | 4722946 | |
| T | 2361473 | 5.0% |
| Z | 2361473 | 5.0% |
| 4 | 1546513 | 3.3% |
| 3 | 1539433 | 3.3% |
| 5 | 1442686 | 3.1% |
| Other values (4) | 4519534 | 9.6% |
publisher
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.0 MiB |
Length
| Max length | 59 |
|---|---|
| Median length | 59 |
| Mean length | 59 |
| Min length | 59 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | National Museum of Natural History, Smithsonian Institution |
|---|---|
| 2nd row | National Museum of Natural History, Smithsonian Institution |
| 3rd row | National Museum of Natural History, Smithsonian Institution |
| 4th row | National Museum of Natural History, Smithsonian Institution |
| 5th row | National Museum of Natural History, Smithsonian Institution |
| Value | Count | Frequency (%) |
| national | 2361473 | |
| museum | 2361473 | |
| of | 2361473 | |
| natural | 2361473 | |
| history | 2361473 | |
| smithsonian | 2361473 | |
| institution | 2361473 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 16530311 | |
| i | 14168838 | |
| 14168838 | ||
| a | 11807365 | 8.5% |
| o | 11807365 | 8.5% |
| n | 11807365 | 8.5% |
| s | 9445892 | 6.8% |
| u | 9445892 | 6.8% |
| r | 4722946 | 3.4% |
| m | 4722946 | 3.4% |
| Other values (11) | 30699149 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 108627758 | |
| Space Separator | 14168838 | 10.2% |
| Uppercase Letter | 14168838 | 10.2% |
| Other Punctuation | 2361473 | 1.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 16530311 | |
| i | 14168838 | |
| a | 11807365 | |
| o | 11807365 | |
| n | 11807365 | |
| s | 9445892 | |
| u | 9445892 | |
| r | 4722946 | 4.3% |
| m | 4722946 | 4.3% |
| l | 4722946 | 4.3% |
| Other values (4) | 9445892 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 4722946 | |
| M | 2361473 | |
| H | 2361473 | |
| S | 2361473 | |
| I | 2361473 |
Space Separator
| Value | Count | Frequency (%) |
| 14168838 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 2361473 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 122796596 | |
| Common | 16530311 | 11.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 16530311 | |
| i | 14168838 | |
| a | 11807365 | |
| o | 11807365 | |
| n | 11807365 | |
| s | 9445892 | 7.7% |
| u | 9445892 | 7.7% |
| r | 4722946 | 3.8% |
| m | 4722946 | 3.8% |
| N | 4722946 | 3.8% |
| Other values (9) | 23614730 |
Common
| Value | Count | Frequency (%) |
| 14168838 | ||
| , | 2361473 | 14.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 139326907 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 16530311 | |
| i | 14168838 | |
| 14168838 | ||
| a | 11807365 | 8.5% |
| o | 11807365 | 8.5% |
| n | 11807365 | 8.5% |
| s | 9445892 | 6.8% |
| u | 9445892 | 6.8% |
| r | 4722946 | 3.4% |
| m | 4722946 | 3.4% |
| Other values (11) | 30699149 |
institutionID
Text
| Distinct | 37 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.0 MiB |
Length
| Max length | 29 |
|---|---|
| Median length | 29 |
| Mean length | 28.98757852 |
| Min length | 2 |
Unique
| Unique | 8 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | urn:lsid:biocol.org:col:34871 |
|---|---|
| 2nd row | urn:lsid:biocol.org:col:15463 |
| 3rd row | urn:lsid:biocol.org:col:34871 |
| 4th row | urn:lsid:biocol.org:col:34871 |
| 5th row | urn:lsid:biocol.org:col:34871 |
| Value | Count | Frequency (%) |
| urn:lsid:biocol.org:col:34871 | 1210999 | |
| urn:lsid:biocol.org:col:15463 | 1149318 | |
| nsmt | 255 | < 0.1% |
| uam | 205 | < 0.1% |
| rmnh | 94 | < 0.1% |
| nrm | 92 | < 0.1% |
| nmv | 65 | < 0.1% |
| rcs | 61 | < 0.1% |
| zmmu | 46 | < 0.1% |
| nmsz | 44 | < 0.1% |
| Other values (27) | 294 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 9441268 | |
| : | 9441268 | |
| l | 7080951 | 10.3% |
| c | 4720634 | 6.9% |
| i | 4720634 | 6.9% |
| r | 4720634 | 6.9% |
| s | 2360317 | 3.4% |
| d | 2360317 | 3.4% |
| b | 2360317 | 3.4% |
| n | 2360317 | 3.4% |
| Other values (30) | 18886727 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 44846023 | |
| Other Punctuation | 11801585 | 17.2% |
| Decimal Number | 11801585 | 17.2% |
| Uppercase Letter | 4191 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1128 | |
| N | 681 | |
| S | 459 | |
| A | 347 | 8.3% |
| U | 306 | 7.3% |
| T | 255 | 6.1% |
| R | 251 | 6.0% |
| H | 151 | 3.6% |
| C | 125 | 3.0% |
| Z | 116 | 2.8% |
| Other values (10) | 372 | 8.9% |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 9441268 | |
| l | 7080951 | |
| c | 4720634 | |
| i | 4720634 | |
| r | 4720634 | |
| s | 2360317 | 5.3% |
| d | 2360317 | 5.3% |
| b | 2360317 | 5.3% |
| n | 2360317 | 5.3% |
| g | 2360317 | 5.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 2360317 | |
| 4 | 2360317 | |
| 1 | 2360317 | |
| 8 | 1210999 | |
| 7 | 1210999 | |
| 5 | 1149318 | |
| 6 | 1149318 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 9441268 | |
| . | 2360317 | 20.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 44850214 | |
| Common | 23603170 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 9441268 | |
| l | 7080951 | |
| c | 4720634 | |
| i | 4720634 | |
| r | 4720634 | |
| s | 2360317 | 5.3% |
| d | 2360317 | 5.3% |
| b | 2360317 | 5.3% |
| n | 2360317 | 5.3% |
| g | 2360317 | 5.3% |
| Other values (21) | 2364508 | 5.3% |
Common
| Value | Count | Frequency (%) |
| : | 9441268 | |
| . | 2360317 | 10.0% |
| 3 | 2360317 | 10.0% |
| 4 | 2360317 | 10.0% |
| 1 | 2360317 | 10.0% |
| 8 | 1210999 | 5.1% |
| 7 | 1210999 | 5.1% |
| 5 | 1149318 | 4.9% |
| 6 | 1149318 | 4.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 68453384 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 9441268 | |
| : | 9441268 | |
| l | 7080951 | 10.3% |
| c | 4720634 | 6.9% |
| i | 4720634 | 6.9% |
| r | 4720634 | 6.9% |
| s | 2360317 | 3.4% |
| d | 2360317 | 3.4% |
| b | 2360317 | 3.4% |
| n | 2360317 | 3.4% |
| Other values (30) | 18886727 |
collectionID
Text
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.0 MiB |
Length
| Max length | 45 |
|---|---|
| Median length | 45 |
| Mean length | 45 |
| Min length | 45 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | urn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6 |
|---|---|
| 2nd row | urn:uuid:60e28f81-e634-4869-aa3e-732caed713c8 |
| 3rd row | urn:uuid:cc104cbf-fd8e-4801-9b71-36731a7db1a0 |
| 4th row | urn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6 |
| 5th row | urn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6 |
| Value | Count | Frequency (%) |
| urn:uuid:60e28f81-e634-4869-aa3e-732caed713c8 | 1149318 | |
| urn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6 | 490281 | |
| urn:uuid:18e3cd08-a962-4f0a-b72c-9a0b3600c5ad | 154108 | 6.5% |
| urn:uuid:59e56a59-8615-4e0c-841d-eb88f3876b22 | 152955 | 6.5% |
| urn:uuid:73d83e23-1999-42cd-b38a-c06a7d32d893 | 149231 | 6.3% |
| urn:uuid:cc104cbf-fd8e-4801-9b71-36731a7db1a0 | 148897 | 6.3% |
| urn:uuid:09c9cf5f-f5d3-48cc-b5c8-cd9b9fbd631f | 116683 | 4.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 9445892 | 8.9% |
| 8 | 8119959 | 7.6% |
| d | 7177853 | 6.8% |
| u | 7084419 | 6.7% |
| 3 | 6484989 | 6.1% |
| e | 5998654 | 5.6% |
| c | 5830009 | 5.5% |
| 1 | 5730177 | 5.4% |
| a | 5303878 | 5.0% |
| 6 | 5120127 | 4.8% |
| Other values (12) | 39970328 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 47269123 | |
| Lowercase Letter | 44828324 | |
| Dash Punctuation | 9445892 | 8.9% |
| Other Punctuation | 4722946 | 4.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 8119959 | |
| 3 | 6484989 | |
| 1 | 5730177 | |
| 6 | 5120127 | |
| 2 | 4831298 | |
| 4 | 4793205 | |
| 7 | 4331414 | |
| 9 | 3956559 | |
| 0 | 2785418 | 5.9% |
| 5 | 1115977 | 2.4% |
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 7177853 | |
| u | 7084419 | |
| e | 5998654 | |
| c | 5830009 | |
| a | 5303878 | |
| f | 3808433 | |
| b | 2540659 | 5.7% |
| r | 2361473 | 5.3% |
| i | 2361473 | 5.3% |
| n | 2361473 | 5.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 9445892 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 4722946 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 61437961 | |
| Latin | 44828324 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 9445892 | |
| 8 | 8119959 | |
| 3 | 6484989 | |
| 1 | 5730177 | |
| 6 | 5120127 | |
| 2 | 4831298 | |
| 4 | 4793205 | |
| : | 4722946 | |
| 7 | 4331414 | |
| 9 | 3956559 | |
| Other values (2) | 3901395 |
Latin
| Value | Count | Frequency (%) |
| d | 7177853 | |
| u | 7084419 | |
| e | 5998654 | |
| c | 5830009 | |
| a | 5303878 | |
| f | 3808433 | |
| b | 2540659 | 5.7% |
| r | 2361473 | 5.3% |
| i | 2361473 | 5.3% |
| n | 2361473 | 5.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 106266285 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 9445892 | 8.9% |
| 8 | 8119959 | 7.6% |
| d | 7177853 | 6.8% |
| u | 7084419 | 6.7% |
| 3 | 6484989 | 6.1% |
| e | 5998654 | 5.6% |
| c | 5830009 | 5.5% |
| 1 | 5730177 | 5.4% |
| a | 5303878 | 5.0% |
| 6 | 5120127 | 4.8% |
| Other values (12) | 39970328 |
institutionCode
Text
| Distinct | 37 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.0 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 3.026425879 |
| Min length | 2 |
Unique
| Unique | 8 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | USNM |
|---|---|
| 2nd row | US |
| 3rd row | USNM |
| 4th row | USNM |
| 5th row | USNM |
| Value | Count | Frequency (%) |
| usnm | 1210999 | |
| us | 1149318 | |
| nsmt | 255 | < 0.1% |
| uam | 205 | < 0.1% |
| rmnh | 94 | < 0.1% |
| nrm | 92 | < 0.1% |
| nmv | 65 | < 0.1% |
| rcs | 61 | < 0.1% |
| zmmu | 46 | < 0.1% |
| nmsz | 44 | < 0.1% |
| Other values (27) | 294 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 2360776 | |
| U | 2360623 | |
| M | 1212127 | |
| N | 1211680 | |
| A | 347 | < 0.1% |
| T | 255 | < 0.1% |
| R | 251 | < 0.1% |
| H | 151 | < 0.1% |
| C | 125 | < 0.1% |
| Z | 116 | < 0.1% |
| Other values (10) | 372 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 7146823 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 2360776 | |
| U | 2360623 | |
| M | 1212127 | |
| N | 1211680 | |
| A | 347 | < 0.1% |
| T | 255 | < 0.1% |
| R | 251 | < 0.1% |
| H | 151 | < 0.1% |
| C | 125 | < 0.1% |
| Z | 116 | < 0.1% |
| Other values (10) | 372 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7146823 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 2360776 | |
| U | 2360623 | |
| M | 1212127 | |
| N | 1211680 | |
| A | 347 | < 0.1% |
| T | 255 | < 0.1% |
| R | 251 | < 0.1% |
| H | 151 | < 0.1% |
| C | 125 | < 0.1% |
| Z | 116 | < 0.1% |
| Other values (10) | 372 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7146823 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 2360776 | |
| U | 2360623 | |
| M | 1212127 | |
| N | 1211680 | |
| A | 347 | < 0.1% |
| T | 255 | < 0.1% |
| R | 251 | < 0.1% |
| H | 151 | < 0.1% |
| C | 125 | < 0.1% |
| Z | 116 | < 0.1% |
| Other values (10) | 372 | < 0.1% |
collectionCode
Text
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.0 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 2 |
| Mean length | 2.609310799 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | IZ |
|---|---|
| 2nd row | US |
| 3rd row | HERP |
| 4th row | IZ |
| 5th row | IZ |
| Value | Count | Frequency (%) |
| us | 1149318 | |
| iz | 490281 | |
| ent | 154108 | 6.5% |
| mamm | 152955 | 6.5% |
| birds | 149231 | 6.3% |
| herp | 148897 | 6.3% |
| fish | 116683 | 4.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 1415232 | |
| U | 1149318 | |
| I | 756195 | |
| Z | 490281 | 8.0% |
| M | 458865 | 7.4% |
| E | 303005 | 4.9% |
| R | 298128 | 4.8% |
| H | 265580 | 4.3% |
| N | 154108 | 2.5% |
| T | 154108 | 2.5% |
| Other values (5) | 716997 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 6161817 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1415232 | |
| U | 1149318 | |
| I | 756195 | |
| Z | 490281 | 8.0% |
| M | 458865 | 7.4% |
| E | 303005 | 4.9% |
| R | 298128 | 4.8% |
| H | 265580 | 4.3% |
| N | 154108 | 2.5% |
| T | 154108 | 2.5% |
| Other values (5) | 716997 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6161817 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 1415232 | |
| U | 1149318 | |
| I | 756195 | |
| Z | 490281 | 8.0% |
| M | 458865 | 7.4% |
| E | 303005 | 4.9% |
| R | 298128 | 4.8% |
| H | 265580 | 4.3% |
| N | 154108 | 2.5% |
| T | 154108 | 2.5% |
| Other values (5) | 716997 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6161817 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 1415232 | |
| U | 1149318 | |
| I | 756195 | |
| Z | 490281 | 8.0% |
| M | 458865 | 7.4% |
| E | 303005 | 4.9% |
| R | 298128 | 4.8% |
| H | 265580 | 4.3% |
| N | 154108 | 2.5% |
| T | 154108 | 2.5% |
| Other values (5) | 716997 |
datasetName
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.0 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NMNH Extant Biology |
|---|---|
| 2nd row | NMNH Extant Biology |
| 3rd row | NMNH Extant Biology |
| 4th row | NMNH Extant Biology |
| 5th row | NMNH Extant Biology |
| Value | Count | Frequency (%) |
| nmnh | 2361473 | |
| extant | 2361473 | |
| biology | 2361473 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 4722946 | 10.5% |
| 4722946 | 10.5% | |
| t | 4722946 | 10.5% |
| o | 4722946 | 10.5% |
| M | 2361473 | 5.3% |
| H | 2361473 | 5.3% |
| E | 2361473 | 5.3% |
| x | 2361473 | 5.3% |
| a | 2361473 | 5.3% |
| n | 2361473 | 5.3% |
| Other values (5) | 11807365 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 25976203 | |
| Uppercase Letter | 14168838 | |
| Space Separator | 4722946 | 10.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 4722946 | |
| o | 4722946 | |
| x | 2361473 | |
| a | 2361473 | |
| n | 2361473 | |
| i | 2361473 | |
| l | 2361473 | |
| g | 2361473 | |
| y | 2361473 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 4722946 | |
| M | 2361473 | |
| H | 2361473 | |
| E | 2361473 | |
| B | 2361473 |
Space Separator
| Value | Count | Frequency (%) |
| 4722946 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 40145041 | |
| Common | 4722946 | 10.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 4722946 | |
| t | 4722946 | |
| o | 4722946 | |
| M | 2361473 | 5.9% |
| H | 2361473 | 5.9% |
| E | 2361473 | 5.9% |
| x | 2361473 | 5.9% |
| a | 2361473 | 5.9% |
| n | 2361473 | 5.9% |
| B | 2361473 | 5.9% |
| Other values (4) | 9445892 |
Common
| Value | Count | Frequency (%) |
| 4722946 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 44867987 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 4722946 | 10.5% |
| 4722946 | 10.5% | |
| t | 4722946 | 10.5% |
| o | 4722946 | 10.5% |
| M | 2361473 | 5.3% |
| H | 2361473 | 5.3% |
| E | 2361473 | 5.3% |
| x | 2361473 | 5.3% |
| a | 2361473 | 5.3% |
| n | 2361473 | 5.3% |
| Other values (5) | 11807365 |
basisOfRecord
Text
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.0 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 18 |
| Mean length | 18.00610509 |
| Min length | 17 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PRESERVED_SPECIMEN |
|---|---|
| 2nd row | PRESERVED_SPECIMEN |
| 3rd row | PRESERVED_SPECIMEN |
| 4th row | PRESERVED_SPECIMEN |
| 5th row | PRESERVED_SPECIMEN |
| Value | Count | Frequency (%) |
| preserved_specimen | 2329878 | |
| machine_observation | 23006 | 1.0% |
| human_observation | 8589 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 11703991 | |
| R | 4691351 | |
| S | 4691351 | |
| P | 4659756 | 11.0% |
| N | 2393068 | 5.6% |
| I | 2384479 | 5.6% |
| _ | 2361473 | 5.6% |
| M | 2361473 | 5.6% |
| V | 2361473 | 5.6% |
| C | 2352884 | 5.5% |
| Other values (7) | 2559632 | 6.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 40159458 | |
| Connector Punctuation | 2361473 | 5.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 11703991 | |
| R | 4691351 | |
| S | 4691351 | |
| P | 4659756 | 11.6% |
| N | 2393068 | 6.0% |
| I | 2384479 | 5.9% |
| M | 2361473 | 5.9% |
| V | 2361473 | 5.9% |
| C | 2352884 | 5.9% |
| D | 2329878 | 5.8% |
| Other values (6) | 229754 | 0.6% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2361473 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 40159458 | |
| Common | 2361473 | 5.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 11703991 | |
| R | 4691351 | |
| S | 4691351 | |
| P | 4659756 | 11.6% |
| N | 2393068 | 6.0% |
| I | 2384479 | 5.9% |
| M | 2361473 | 5.9% |
| V | 2361473 | 5.9% |
| C | 2352884 | 5.9% |
| D | 2329878 | 5.8% |
| Other values (6) | 229754 | 0.6% |
Common
| Value | Count | Frequency (%) |
| _ | 2361473 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 42520931 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 11703991 | |
| R | 4691351 | |
| S | 4691351 | |
| P | 4659756 | 11.0% |
| N | 2393068 | 5.6% |
| I | 2384479 | 5.6% |
| _ | 2361473 | 5.6% |
| M | 2361473 | 5.6% |
| V | 2361473 | 5.6% |
| C | 2352884 | 5.5% |
| Other values (7) | 2559632 | 6.0% |
occurrenceID
Text
Unique 
| Distinct | 2361473 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.0 MiB |
Length
| Max length | 63 |
|---|---|
| Median length | 63 |
| Mean length | 62.99999068 |
| Min length | 41 |
Unique
| Unique | 2361473 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | http://n2t.net/ark:/65665/3c1d5cd1b-23f9-4aab-8cd8-011e6535be18 |
|---|---|
| 2nd row | http://n2t.net/ark:/65665/38212d138-cfcd-4363-8d3b-93b82afc1d4b |
| 3rd row | http://n2t.net/ark:/65665/3c1d69371-acc7-4c47-bc57-9d5ba7994267 |
| 4th row | http://n2t.net/ark:/65665/382140f93-30c1-4f26-bd0c-77d197d5ebc0 |
| 5th row | http://n2t.net/ark:/65665/3c1d814f8-bb57-4c37-a953-dd84b1c6415d |
| Value | Count | Frequency (%) |
| http://n2t.net/ark:/65665/3c1d5cd1b-23f9-4aab-8cd8-011e6535be18 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/382a04cb2-f704-42e5-bba7-b8a5c5cb730e | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3821e29b7-fb9d-454b-bb15-3423f912baa1 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3822c38fd-38fc-4edd-913d-06e17f9f83c5 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c1db6db1-1cf4-4831-a73a-6e62a01a92ec | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c1d69371-acc7-4c47-bc57-9d5ba7994267 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/382140f93-30c1-4f26-bd0c-77d197d5ebc0 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c1d814f8-bb57-4c37-a953-dd84b1c6415d | 1 | < 0.1% |
| http://n2t.net/ark:/65665/38215186e-af4f-46dc-8b81-ec58617bdfd7 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c1d9c4a8-7ba7-48dd-b92e-9924960b16d2 | 1 | < 0.1% |
| Other values (2361463) | 2361463 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 11807365 | 7.9% |
| 6 | 11516266 | 7.7% |
| t | 9445892 | 6.3% |
| - | 9445890 | 6.3% |
| 5 | 9149800 | 6.2% |
| a | 7384461 | 5.0% |
| 2 | 6789572 | 4.6% |
| 3 | 6788565 | 4.6% |
| 4 | 6786988 | 4.6% |
| e | 6782983 | 4.6% |
| Other values (16) | 62874995 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 64357740 | |
| Lowercase Letter | 56077363 | |
| Other Punctuation | 18891784 | 12.7% |
| Dash Punctuation | 9445890 | 6.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 9445892 | |
| a | 7384461 | |
| e | 6782983 | |
| b | 5018118 | |
| n | 4722946 | |
| c | 4431867 | |
| d | 4426436 | |
| f | 4418768 | |
| k | 2361473 | 4.2% |
| r | 2361473 | 4.2% |
| Other values (2) | 4722946 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 11516266 | |
| 5 | 9149800 | |
| 2 | 6789572 | |
| 3 | 6788565 | |
| 4 | 6786988 | |
| 9 | 5027535 | |
| 8 | 5024119 | |
| 7 | 4432331 | 6.9% |
| 1 | 4425192 | 6.9% |
| 0 | 4417372 | 6.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 11807365 | |
| : | 4722946 | 25.0% |
| . | 2361473 | 12.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 9445890 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 92695414 | |
| Latin | 56077363 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| / | 11807365 | |
| 6 | 11516266 | |
| - | 9445890 | |
| 5 | 9149800 | |
| 2 | 6789572 | |
| 3 | 6788565 | |
| 4 | 6786988 | |
| 9 | 5027535 | 5.4% |
| 8 | 5024119 | 5.4% |
| : | 4722946 | 5.1% |
| Other values (4) | 15636368 |
Latin
| Value | Count | Frequency (%) |
| t | 9445892 | |
| a | 7384461 | |
| e | 6782983 | |
| b | 5018118 | |
| n | 4722946 | |
| c | 4431867 | |
| d | 4426436 | |
| f | 4418768 | |
| k | 2361473 | 4.2% |
| r | 2361473 | 4.2% |
| Other values (2) | 4722946 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 148772777 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 11807365 | 7.9% |
| 6 | 11516266 | 7.7% |
| t | 9445892 | 6.3% |
| - | 9445890 | 6.3% |
| 5 | 9149800 | 6.2% |
| a | 7384461 | 5.0% |
| 2 | 6789572 | 4.6% |
| 3 | 6788565 | 4.6% |
| 4 | 6786988 | 4.6% |
| e | 6782983 | 4.6% |
| Other values (16) | 62874995 |
catalogNumber
Text
Missing 
| Distinct | 1790648 |
|---|---|
| Distinct (%) | 83.4% |
| Missing | 213212 |
| Missing (%) | 9.0% |
| Memory size | 18.0 MiB |
Length
| Max length | 22 |
|---|---|
| Median length | 21 |
| Mean length | 10.54068477 |
| Min length | 4 |
Unique
| Unique | 1533820 ? |
|---|---|
| Unique (%) | 71.4% |
Sample
| 1st row | USNM 1220020 |
|---|---|
| 2nd row | US 2327562 |
| 3rd row | USNM 359728 |
| 4th row | USNM 65866 |
| 5th row | USNM 1569732 |
| Value | Count | Frequency (%) |
| usnm | 1056890 | |
| us | 984436 | |
| herp | 1447 | < 0.1% |
| tissue | 1416 | < 0.1% |
| sem | 65 | < 0.1% |
| 48 | < 0.1% | |
| 1 | 41 | < 0.1% |
| stub | 40 | < 0.1% |
| image | 31 | < 0.1% |
| micrograph | 25 | < 0.1% |
| Other values (1602736) | 2148327 |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 2152376 | 9.5% |
| U | 2147427 | 9.5% |
| 2044505 | 9.0% | |
| 1 | 1805413 | 8.0% |
| 2 | 1634081 | 7.2% |
| 3 | 1537039 | 6.8% |
| 0 | 1310826 | 5.8% |
| 4 | 1307100 | 5.8% |
| 5 | 1283378 | 5.7% |
| N | 1249869 | 5.5% |
| Other values (57) | 6172128 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 13582232 | |
| Uppercase Letter | 6983762 | |
| Space Separator | 2044505 | 9.0% |
| Lowercase Letter | 25990 | 0.1% |
| Dash Punctuation | 5780 | < 0.1% |
| Other Punctuation | 1853 | < 0.1% |
| Close Punctuation | 10 | < 0.1% |
| Open Punctuation | 10 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 2152376 | |
| U | 2147427 | |
| N | 1249869 | |
| M | 1159926 | |
| E | 112070 | 1.6% |
| T | 100478 | 1.4% |
| A | 17145 | 0.2% |
| D | 16761 | 0.2% |
| R | 11166 | 0.2% |
| B | 8984 | 0.1% |
| Other values (15) | 7560 | 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| w | 11053 | |
| e | 2937 | 11.3% |
| s | 2832 | 10.9% |
| a | 2180 | 8.4% |
| r | 1498 | 5.8% |
| p | 1473 | 5.7% |
| u | 1466 | 5.6% |
| i | 1445 | 5.6% |
| b | 476 | 1.8% |
| c | 186 | 0.7% |
| Other values (15) | 444 | 1.7% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1805413 | |
| 2 | 1634081 | |
| 3 | 1537039 | |
| 0 | 1310826 | |
| 4 | 1307100 | |
| 5 | 1283378 | |
| 6 | 1213835 | |
| 7 | 1186666 | |
| 8 | 1170155 | |
| 9 | 1133739 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1056 | |
| * | 796 | |
| ? | 1 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2044505 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5780 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 10 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 10 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 15634390 | |
| Latin | 7009752 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 2152376 | |
| U | 2147427 | |
| N | 1249869 | |
| M | 1159926 | |
| E | 112070 | 1.6% |
| T | 100478 | 1.4% |
| A | 17145 | 0.2% |
| D | 16761 | 0.2% |
| R | 11166 | 0.2% |
| w | 11053 | 0.2% |
| Other values (40) | 31481 | 0.4% |
Common
| Value | Count | Frequency (%) |
| 2044505 | ||
| 1 | 1805413 | |
| 2 | 1634081 | |
| 3 | 1537039 | |
| 0 | 1310826 | |
| 4 | 1307100 | |
| 5 | 1283378 | |
| 6 | 1213835 | |
| 7 | 1186666 | |
| 8 | 1170155 | |
| Other values (7) | 1141392 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 22644142 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 2152376 | 9.5% |
| U | 2147427 | 9.5% |
| 2044505 | 9.0% | |
| 1 | 1805413 | 8.0% |
| 2 | 1634081 | 7.2% |
| 3 | 1537039 | 6.8% |
| 0 | 1310826 | 5.8% |
| 4 | 1307100 | 5.8% |
| 5 | 1283378 | 5.7% |
| N | 1249869 | 5.5% |
| Other values (57) | 6172128 |
recordNumber
Text
Missing 
| Distinct | 253149 |
|---|---|
| Distinct (%) | 19.2% |
| Missing | 1045439 |
| Missing (%) | 44.3% |
| Memory size | 18.0 MiB |
Length
| Max length | 93 |
|---|---|
| Median length | 90 |
| Mean length | 4.785216035 |
| Min length | 1 |
Unique
| Unique | 197322 ? |
|---|---|
| Unique (%) | 15.0% |
Sample
| 1st row | 5209 |
|---|---|
| 2nd row | USNPC # 008843 |
| 3rd row | USNPC # 074963 |
| 4th row | 478 |
| 5th row | s.n. |
| Value | Count | Frequency (%) |
| s.n | 164138 | 11.2% |
| 26102 | 1.8% | |
| usnpc | 22710 | 1.5% |
| no | 12214 | 0.8% |
| number | 11997 | 0.8% |
| bureau | 5232 | 0.4% |
| eyd | 4047 | 0.3% |
| s | 3600 | 0.2% |
| of | 3507 | 0.2% |
| n | 3489 | 0.2% |
| Other values (191948) | 1214297 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 716221 | |
| 2 | 556077 | 8.8% |
| 3 | 479210 | 7.6% |
| 0 | 459758 | 7.3% |
| 4 | 448405 | 7.1% |
| 5 | 430657 | 6.8% |
| 6 | 416990 | 6.6% |
| 7 | 393873 | 6.3% |
| 8 | 377941 | 6.0% |
| 9 | 367752 | 5.8% |
| Other values (94) | 1650623 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4646884 | |
| Lowercase Letter | 543523 | 8.6% |
| Uppercase Letter | 454883 | 7.2% |
| Other Punctuation | 398023 | 6.3% |
| Space Separator | 155299 | 2.5% |
| Dash Punctuation | 89908 | 1.4% |
| Connector Punctuation | 3813 | 0.1% |
| Close Punctuation | 2297 | < 0.1% |
| Open Punctuation | 2296 | < 0.1% |
| Other Number | 408 | < 0.1% |
| Other values (2) | 173 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 175707 | |
| s | 172005 | |
| e | 30020 | 5.5% |
| u | 24493 | 4.5% |
| r | 24320 | 4.5% |
| o | 21858 | 4.0% |
| a | 19196 | 3.5% |
| b | 18006 | 3.3% |
| m | 13239 | 2.4% |
| c | 10258 | 1.9% |
| Other values (23) | 34421 | 6.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 58959 | |
| S | 46028 | 10.1% |
| C | 37900 | 8.3% |
| P | 36208 | 8.0% |
| U | 27138 | 6.0% |
| B | 24770 | 5.4% |
| A | 23606 | 5.2% |
| H | 20043 | 4.4% |
| D | 18973 | 4.2% |
| L | 18356 | 4.0% |
| Other values (18) | 142902 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 347774 | |
| # | 22903 | 5.8% |
| / | 12068 | 3.0% |
| & | 6380 | 1.6% |
| * | 3583 | 0.9% |
| ? | 2718 | 0.7% |
| , | 1563 | 0.4% |
| ! | 632 | 0.2% |
| : | 243 | 0.1% |
| ; | 104 | < 0.1% |
| Other values (3) | 55 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 716221 | |
| 2 | 556077 | |
| 3 | 479210 | |
| 0 | 459758 | |
| 4 | 448405 | |
| 5 | 430657 | |
| 6 | 416990 | |
| 7 | 393873 | |
| 8 | 377941 | |
| 9 | 367752 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 393 | |
| ² | 6 | 1.5% |
| ¼ | 4 | 1.0% |
| ³ | 2 | 0.5% |
| ¾ | 2 | 0.5% |
| ⅓ | 1 | 0.2% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2122 | |
| ] | 114 | 5.0% |
| } | 61 | 2.7% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2121 | |
| [ | 114 | 5.0% |
| { | 61 | 2.7% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 130 | |
| + | 40 | 23.3% |
| ~ | 2 | 1.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 89907 | |
| – | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 155299 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 3813 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5299101 | |
| Latin | 998406 | 15.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 175707 | |
| s | 172005 | |
| N | 58959 | 5.9% |
| S | 46028 | 4.6% |
| C | 37900 | 3.8% |
| P | 36208 | 3.6% |
| e | 30020 | 3.0% |
| U | 27138 | 2.7% |
| B | 24770 | 2.5% |
| u | 24493 | 2.5% |
| Other values (51) | 365178 |
Common
| Value | Count | Frequency (%) |
| 1 | 716221 | |
| 2 | 556077 | |
| 3 | 479210 | |
| 0 | 459758 | |
| 4 | 448405 | |
| 5 | 430657 | |
| 6 | 416990 | |
| 7 | 393873 | |
| 8 | 377941 | |
| 9 | 367752 | |
| Other values (33) | 652217 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6297080 | |
| None | 425 | < 0.1% |
| Punctuation | 1 | < 0.1% |
| Number Forms | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 716221 | |
| 2 | 556077 | 8.8% |
| 3 | 479210 | 7.6% |
| 0 | 459758 | 7.3% |
| 4 | 448405 | 7.1% |
| 5 | 430657 | 6.8% |
| 6 | 416990 | 6.6% |
| 7 | 393873 | 6.3% |
| 8 | 377941 | 6.0% |
| 9 | 367752 | 5.8% |
| Other values (77) | 1650196 |
None
| Value | Count | Frequency (%) |
| ½ | 393 | |
| ² | 6 | 1.4% |
| è | 5 | 1.2% |
| ¼ | 4 | 0.9% |
| é | 3 | 0.7% |
| á | 3 | 0.7% |
| ³ | 2 | 0.5% |
| ¾ | 2 | 0.5% |
| Ʃ | 1 | 0.2% |
| ó | 1 | 0.2% |
| Other values (5) | 5 | 1.2% |
Punctuation
| Value | Count | Frequency (%) |
| – | 1 |
Number Forms
| Value | Count | Frequency (%) |
| ⅓ | 1 |
recordedBy
Text
Missing 
| Distinct | 115852 |
|---|---|
| Distinct (%) | 6.2% |
| Missing | 498671 |
| Missing (%) | 21.1% |
| Memory size | 18.0 MiB |
Length
| Max length | 239 |
|---|---|
| Median length | 171 |
| Mean length | 17.19955529 |
| Min length | 1 |
Unique
| Unique | 56989 ? |
|---|---|
| Unique (%) | 3.1% |
Sample
| 1st row | G. Hendler |
|---|---|
| 2nd row | R. C. Rollins & D. Rollins |
| 3rd row | T. Vaughan |
| 4th row | D. Harper |
| 5th row | F. Harvey |
| Value | Count | Frequency (%) |
| 413362 | 6.3% | |
| j | 303894 | 4.6% |
| a | 242831 | 3.7% |
| r | 228710 | 3.5% |
| e | 216487 | 3.3% |
| c | 207629 | 3.2% |
| m | 197349 | 3.0% |
| h | 179143 | 2.7% |
| w | 156602 | 2.4% |
| l | 143933 | 2.2% |
| Other values (44536) | 4249869 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4677007 | 14.6% | |
| . | 2978690 | 9.3% |
| e | 2217324 | 6.9% |
| a | 1603560 | 5.0% |
| r | 1548210 | 4.8% |
| n | 1462460 | 4.6% |
| o | 1456032 | 4.5% |
| i | 1334932 | 4.2% |
| l | 1157612 | 3.6% |
| t | 1153395 | 3.6% |
| Other values (133) | 12450144 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 17261749 | |
| Uppercase Letter | 6291902 | 19.6% |
| Space Separator | 4677007 | 14.6% |
| Other Punctuation | 3656578 | 11.4% |
| Dash Punctuation | 103346 | 0.3% |
| Close Punctuation | 21592 | 0.1% |
| Open Punctuation | 21572 | 0.1% |
| Decimal Number | 5575 | < 0.1% |
| Math Symbol | 30 | < 0.1% |
| Modifier Symbol | 9 | < 0.1% |
| Other values (3) | 6 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2217324 | |
| a | 1603560 | |
| r | 1548210 | |
| n | 1462460 | 8.5% |
| o | 1456032 | 8.4% |
| i | 1334932 | 7.7% |
| l | 1157612 | 6.7% |
| t | 1153395 | 6.7% |
| s | 1030523 | 6.0% |
| h | 517272 | 3.0% |
| Other values (59) | 3780429 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 576187 | 9.2% |
| S | 551297 | 8.8% |
| C | 480161 | 7.6% |
| R | 395841 | 6.3% |
| H | 394114 | 6.3% |
| B | 381832 | 6.1% |
| J | 365707 | 5.8% |
| A | 357987 | 5.7% |
| L | 337942 | 5.4% |
| W | 302701 | 4.8% |
| Other values (31) | 2148133 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2978690 | |
| & | 368061 | 10.1% |
| , | 243814 | 6.7% |
| / | 61528 | 1.7% |
| ' | 4009 | 0.1% |
| " | 424 | < 0.1% |
| ? | 27 | < 0.1% |
| : | 17 | < 0.1% |
| ; | 5 | < 0.1% |
| # | 2 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1357 | |
| 9 | 1116 | |
| 8 | 933 | |
| 0 | 693 | |
| 3 | 373 | 6.7% |
| 4 | 368 | 6.6% |
| 2 | 338 | 6.1% |
| 5 | 306 | 5.5% |
| 7 | 54 | 1.0% |
| 6 | 37 | 0.7% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 16664 | |
| ( | 4908 | 22.8% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 16662 | |
| ) | 4930 | 22.8% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 26 | |
| + | 4 | 13.3% |
Space Separator
| Value | Count | Frequency (%) |
| 4677007 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 103346 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 9 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 4 |
Final Punctuation
| Value | Count | Frequency (%) |
| » | 1 |
Initial Punctuation
| Value | Count | Frequency (%) |
| « | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23553649 | |
| Common | 8485715 | 26.5% |
| Cyrillic | 1 | < 0.1% |
| Greek | 1 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2217324 | 9.4% |
| a | 1603560 | 6.8% |
| r | 1548210 | 6.6% |
| n | 1462460 | 6.2% |
| o | 1456032 | 6.2% |
| i | 1334932 | 5.7% |
| l | 1157612 | 4.9% |
| t | 1153395 | 4.9% |
| s | 1030523 | 4.4% |
| M | 576187 | 2.4% |
| Other values (98) | 10013414 |
Common
| Value | Count | Frequency (%) |
| 4677007 | ||
| . | 2978690 | |
| & | 368061 | 4.3% |
| , | 243814 | 2.9% |
| - | 103346 | 1.2% |
| / | 61528 | 0.7% |
| [ | 16664 | 0.2% |
| ] | 16662 | 0.2% |
| ) | 4930 | 0.1% |
| ( | 4908 | 0.1% |
| Other values (23) | 10105 | 0.1% |
Cyrillic
| Value | Count | Frequency (%) |
| Ӧ | 1 |
Greek
| Value | Count | Frequency (%) |
| β | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 31975557 | |
| None | 63807 | 0.2% |
| IPA Ext | 1 | < 0.1% |
| Cyrillic | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4677007 | 14.6% | |
| . | 2978690 | 9.3% |
| e | 2217324 | 6.9% |
| a | 1603560 | 5.0% |
| r | 1548210 | 4.8% |
| n | 1462460 | 4.6% |
| o | 1456032 | 4.6% |
| i | 1334932 | 4.2% |
| l | 1157612 | 3.6% |
| t | 1153395 | 3.6% |
| Other values (70) | 12386335 |
None
| Value | Count | Frequency (%) |
| é | 10928 | |
| á | 10904 | |
| ó | 9861 | |
| í | 7315 | |
| ñ | 6341 | |
| è | 4437 | |
| ü | 3492 | 5.5% |
| ö | 2715 | 4.3% |
| ê | 1774 | 2.8% |
| ç | 790 | 1.2% |
| Other values (51) | 5250 |
IPA Ext
| Value | Count | Frequency (%) |
| ɶ | 1 |
Cyrillic
| Value | Count | Frequency (%) |
| Ӧ | 1 |
individualCount
Text
| Distinct | 819 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1023 |
| Missing (%) | < 0.1% |
| Memory size | 18.0 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 1 |
| Mean length | 1.031862145 |
| Min length | 1 |
Unique
| Unique | 338 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 31 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 4 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 2047199 | |
| 2 | 94185 | 4.0% |
| 3 | 46375 | 2.0% |
| 4 | 33192 | 1.4% |
| 5 | 24016 | 1.0% |
| 6 | 16667 | 0.7% |
| 10 | 12127 | 0.5% |
| 7 | 10535 | 0.4% |
| 8 | 9696 | 0.4% |
| 9 | 6408 | 0.3% |
| Other values (809) | 60050 | 2.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2094601 | |
| 2 | 114742 | 4.7% |
| 3 | 56960 | 2.3% |
| 4 | 41191 | 1.7% |
| 5 | 36721 | 1.5% |
| 0 | 30807 | 1.3% |
| 6 | 21946 | 0.9% |
| 7 | 15204 | 0.6% |
| 8 | 13757 | 0.6% |
| 9 | 9730 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2435659 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2094601 | |
| 2 | 114742 | 4.7% |
| 3 | 56960 | 2.3% |
| 4 | 41191 | 1.7% |
| 5 | 36721 | 1.5% |
| 0 | 30807 | 1.3% |
| 6 | 21946 | 0.9% |
| 7 | 15204 | 0.6% |
| 8 | 13757 | 0.6% |
| 9 | 9730 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2435659 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2094601 | |
| 2 | 114742 | 4.7% |
| 3 | 56960 | 2.3% |
| 4 | 41191 | 1.7% |
| 5 | 36721 | 1.5% |
| 0 | 30807 | 1.3% |
| 6 | 21946 | 0.9% |
| 7 | 15204 | 0.6% |
| 8 | 13757 | 0.6% |
| 9 | 9730 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2435659 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2094601 | |
| 2 | 114742 | 4.7% |
| 3 | 56960 | 2.3% |
| 4 | 41191 | 1.7% |
| 5 | 36721 | 1.5% |
| 0 | 30807 | 1.3% |
| 6 | 21946 | 0.9% |
| 7 | 15204 | 0.6% |
| 8 | 13757 | 0.6% |
| 9 | 9730 | 0.4% |
sex
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2009611 |
| Missing (%) | 85.1% |
| Memory size | 18.0 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 4 |
| Mean length | 4.89924459 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | FEMALE |
|---|---|
| 2nd row | MALE |
| 3rd row | MALE |
| 4th row | MALE |
| 5th row | MALE |
| Value | Count | Frequency (%) |
| male | 193937 | |
| female | 157845 | |
| hermaphrodite | 80 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 509787 | |
| M | 351862 | |
| A | 351862 | |
| L | 351782 | |
| F | 157845 | 9.2% |
| H | 160 | < 0.1% |
| R | 160 | < 0.1% |
| P | 80 | < 0.1% |
| O | 80 | < 0.1% |
| D | 80 | < 0.1% |
| Other values (2) | 160 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1723858 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 509787 | |
| M | 351862 | |
| A | 351862 | |
| L | 351782 | |
| F | 157845 | 9.2% |
| H | 160 | < 0.1% |
| R | 160 | < 0.1% |
| P | 80 | < 0.1% |
| O | 80 | < 0.1% |
| D | 80 | < 0.1% |
| Other values (2) | 160 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1723858 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 509787 | |
| M | 351862 | |
| A | 351862 | |
| L | 351782 | |
| F | 157845 | 9.2% |
| H | 160 | < 0.1% |
| R | 160 | < 0.1% |
| P | 80 | < 0.1% |
| O | 80 | < 0.1% |
| D | 80 | < 0.1% |
| Other values (2) | 160 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1723858 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 509787 | |
| M | 351862 | |
| A | 351862 | |
| L | 351782 | |
| F | 157845 | 9.2% |
| H | 160 | < 0.1% |
| R | 160 | < 0.1% |
| P | 80 | < 0.1% |
| O | 80 | < 0.1% |
| D | 80 | < 0.1% |
| Other values (2) | 160 | < 0.1% |
lifeStage
Text
Missing 
| Distinct | 30 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2107148 |
| Missing (%) | 89.2% |
| Memory size | 18.0 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 5 |
| Mean length | 6.528029097 |
| Min length | 3 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Adult |
|---|---|
| 2nd row | Adult |
| 3rd row | Adult |
| 4th row | Fruiting |
| 5th row | Flowering |
| Value | Count | Frequency (%) |
| adult | 137714 | |
| flowering | 49923 | 19.6% |
| fruiting | 26364 | 10.4% |
| juvenile | 15425 | 6.1% |
| immature | 8914 | 3.5% |
| vegetative | 6008 | 2.4% |
| larva | 5218 | 2.1% |
| subadult | 1137 | 0.4% |
| chick | 960 | 0.4% |
| embryo | 589 | 0.2% |
| Other values (20) | 2073 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 204968 | |
| u | 191207 | |
| t | 187477 | |
| d | 138859 | |
| A | 137714 | |
| i | 125782 | |
| e | 108742 | 6.5% |
| n | 93053 | 5.6% |
| r | 91120 | 5.5% |
| g | 83572 | 5.0% |
| Other values (29) | 297747 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1405916 | |
| Uppercase Letter | 254325 | 15.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 204968 | |
| u | 191207 | |
| t | 187477 | |
| d | 138859 | |
| i | 125782 | |
| e | 108742 | |
| n | 93053 | |
| r | 91120 | |
| g | 83572 | |
| o | 50963 | 3.6% |
| Other values (12) | 130173 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 137714 | |
| F | 76458 | |
| J | 15425 | 6.1% |
| I | 8914 | 3.5% |
| V | 6031 | 2.4% |
| L | 5218 | 2.1% |
| S | 1138 | 0.4% |
| C | 963 | 0.4% |
| E | 950 | 0.4% |
| H | 575 | 0.2% |
| Other values (7) | 939 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1660241 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 204968 | |
| u | 191207 | |
| t | 187477 | |
| d | 138859 | |
| A | 137714 | |
| i | 125782 | |
| e | 108742 | 6.5% |
| n | 93053 | 5.6% |
| r | 91120 | 5.5% |
| g | 83572 | 5.0% |
| Other values (29) | 297747 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1660241 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 204968 | |
| u | 191207 | |
| t | 187477 | |
| d | 138859 | |
| A | 137714 | |
| i | 125782 | |
| e | 108742 | 6.5% |
| n | 93053 | 5.6% |
| r | 91120 | 5.5% |
| g | 83572 | 5.0% |
| Other values (29) | 297747 |
occurrenceStatus
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 18.0 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 6.999555786 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PRESENT |
|---|---|
| 2nd row | PRESENT |
| 3rd row | PRESENT |
| 4th row | PRESENT |
| 5th row | PRESENT |
| Value | Count | Frequency (%) |
| present | 2360423 | |
| absent | 1049 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 4721895 | |
| S | 2361472 | |
| N | 2361472 | |
| T | 2361472 | |
| P | 2360423 | |
| R | 2360423 | |
| A | 1049 | < 0.1% |
| B | 1049 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 16529255 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 4721895 | |
| S | 2361472 | |
| N | 2361472 | |
| T | 2361472 | |
| P | 2360423 | |
| R | 2360423 | |
| A | 1049 | < 0.1% |
| B | 1049 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16529255 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 4721895 | |
| S | 2361472 | |
| N | 2361472 | |
| T | 2361472 | |
| P | 2360423 | |
| R | 2360423 | |
| A | 1049 | < 0.1% |
| B | 1049 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16529255 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 4721895 | |
| S | 2361472 | |
| N | 2361472 | |
| T | 2361472 | |
| P | 2360423 | |
| R | 2360423 | |
| A | 1049 | < 0.1% |
| B | 1049 | < 0.1% |
preparations
Text
Missing 
| Distinct | 1125 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1223408 |
| Missing (%) | 51.8% |
| Memory size | 18.0 MiB |
Length
| Max length | 192 |
|---|---|
| Median length | 154 |
| Mean length | 9.646168716 |
| Min length | 3 |
Unique
| Unique | 452 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Alcohol (Ethanol) |
|---|---|
| 2nd row | Ethanol |
| 3rd row | Dry |
| 4th row | Alcohol (Ethanol) |
| 5th row | Pinned |
| Value | Count | Frequency (%) |
| ethanol | 373474 | |
| dry | 234832 | |
| alcohol | 228646 | |
| skin | 213511 | |
| whole | 136729 | 8.0% |
| skull | 114952 | 6.7% |
| pinned | 99259 | 5.8% |
| slide | 49718 | 2.9% |
| fluid | 34184 | 2.0% |
| envelope | 29335 | 1.7% |
| Other values (239) | 199700 |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 1411530 | 12.9% |
| o | 1133314 | 10.3% |
| n | 892191 | 8.1% |
| h | 772057 | 7.0% |
| 576275 | 5.2% | |
| i | 492145 | 4.5% |
| e | 477125 | 4.3% |
| a | 473893 | 4.3% |
| t | 460115 | 4.2% |
| S | 424074 | 3.9% |
| Other values (64) | 3865248 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7927169 | |
| Uppercase Letter | 1683886 | 15.3% |
| Space Separator | 576275 | 5.2% |
| Other Punctuation | 285737 | 2.6% |
| Open Punctuation | 244959 | 2.2% |
| Close Punctuation | 244959 | 2.2% |
| Decimal Number | 10059 | 0.1% |
| Dash Punctuation | 4923 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 1411530 | |
| o | 1133314 | |
| n | 892191 | |
| h | 772057 | |
| i | 492145 | 6.2% |
| e | 477125 | 6.0% |
| a | 473893 | 6.0% |
| t | 460115 | 5.8% |
| k | 359224 | 4.5% |
| r | 316217 | 4.0% |
| Other values (16) | 1139358 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 424074 | |
| E | 414250 | |
| D | 235606 | |
| A | 232250 | |
| W | 147923 | 8.8% |
| P | 120049 | 7.1% |
| F | 44864 | 2.7% |
| M | 15691 | 0.9% |
| B | 7803 | 0.5% |
| L | 6693 | 0.4% |
| Other values (15) | 34683 | 2.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 4647 | |
| 5 | 4573 | |
| 0 | 456 | 4.5% |
| 8 | 255 | 2.5% |
| 7 | 107 | 1.1% |
| 2 | 9 | 0.1% |
| 1 | 9 | 0.1% |
| 3 | 2 | < 0.1% |
| 6 | 1 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 143613 | |
| ; | 135363 | |
| % | 5018 | 1.8% |
| & | 836 | 0.3% |
| / | 806 | 0.3% |
| . | 62 | < 0.1% |
| , | 37 | < 0.1% |
| ? | 2 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 243972 | |
| [ | 987 | 0.4% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 243972 | |
| ] | 987 | 0.4% |
Space Separator
| Value | Count | Frequency (%) |
| 576275 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4923 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9611055 | |
| Common | 1366912 | 12.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 1411530 | |
| o | 1133314 | |
| n | 892191 | 9.3% |
| h | 772057 | 8.0% |
| i | 492145 | 5.1% |
| e | 477125 | 5.0% |
| a | 473893 | 4.9% |
| t | 460115 | 4.8% |
| S | 424074 | 4.4% |
| E | 414250 | 4.3% |
| Other values (41) | 2660361 |
Common
| Value | Count | Frequency (%) |
| 576275 | ||
| ( | 243972 | |
| ) | 243972 | |
| : | 143613 | 10.5% |
| ; | 135363 | 9.9% |
| % | 5018 | 0.4% |
| - | 4923 | 0.4% |
| 9 | 4647 | 0.3% |
| 5 | 4573 | 0.3% |
| ] | 987 | 0.1% |
| Other values (13) | 3569 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10977967 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 1411530 | 12.9% |
| o | 1133314 | 10.3% |
| n | 892191 | 8.1% |
| h | 772057 | 7.0% |
| 576275 | 5.2% | |
| i | 492145 | 4.5% |
| e | 477125 | 4.3% |
| a | 473893 | 4.3% |
| t | 460115 | 4.2% |
| S | 424074 | 3.9% |
| Other values (64) | 3865248 |
Missing 
| Distinct | 3083 |
|---|---|
| Distinct (%) | 99.4% |
| Missing | 2358372 |
| Missing (%) | 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 12558 |
|---|---|
| Median length | 49 |
| Mean length | 106.4991938 |
| Min length | 47 |
Unique
| Unique | 3075 ? |
|---|---|
| Unique (%) | 99.2% |
Sample
| 1st row | https://www.ncbi.nlm.nih.gov/gquery?term=KM080038 |
|---|---|
| 2nd row | https://www.ncbi.nlm.nih.gov/gquery?term=EU823242;https://www.ncbi.nlm.nih.gov/gquery?term=EU823167;https://www.ncbi.nlm.nih.gov/gquery?term=KC246618 |
| 3rd row | https://www.ncbi.nlm.nih.gov/gquery?term=MN549733 |
| 4th row | https://www.ncbi.nlm.nih.gov/gquery?term=KC771789;https://www.ncbi.nlm.nih.gov/gquery?term=KC771632 |
| 5th row | https://www.ncbi.nlm.nih.gov/gquery?term=HQ600894 |
| Value | Count | Frequency (%) |
| https://www.ncbi.nlm.nih.gov/gquery?term=prjna521985 | 8 | 0.3% |
| https://www.ncbi.nlm.nih.gov/gquery?term=km521547 | 5 | 0.2% |
| https://www.ncbi.nlm.nih.gov/gquery?term=ay273864 | 3 | 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=kf989555;https://www.ncbi.nlm.nih.gov/gquery?term=kf989872;https://www.ncbi.nlm.nih.gov/gquery?term=kf989774;https://www.ncbi.nlm.nih.gov/gquery?term=kf989974;https://www.ncbi.nlm.nih.gov/gquery?term=kf989663 | 2 | 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=kp739770 | 2 | 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=ay273835 | 2 | 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=mh244118 | 2 | 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=jn837192;https://www.ncbi.nlm.nih.gov/gquery?term=jn837282;https://www.ncbi.nlm.nih.gov/gquery?term=jn837372;https://www.ncbi.nlm.nih.gov/gquery?term=jn837475 | 2 | 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=kc771789;https://www.ncbi.nlm.nih.gov/gquery?term=kc771632 | 1 | < 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=mw203870;https://www.ncbi.nlm.nih.gov/gquery?term=mw124994 | 1 | < 0.1% |
| Other values (3073) | 3073 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 26569 | 8.0% |
| t | 19917 | 6.0% |
| / | 19917 | 6.0% |
| w | 19917 | 6.0% |
| n | 19917 | 6.0% |
| i | 13278 | 4.0% |
| r | 13278 | 4.0% |
| e | 13278 | 4.0% |
| m | 13278 | 4.0% |
| h | 13278 | 4.0% |
| Other values (53) | 157627 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 205809 | |
| Other Punctuation | 63302 | 19.2% |
| Decimal Number | 40399 | 12.2% |
| Uppercase Letter | 13972 | 4.2% |
| Math Symbol | 6639 | 2.0% |
| Dash Punctuation | 132 | < 0.1% |
| Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| K | 2560 | |
| M | 1633 | |
| J | 1366 | |
| U | 1146 | 8.2% |
| Q | 975 | 7.0% |
| F | 824 | 5.9% |
| E | 586 | 4.2% |
| R | 568 | 4.1% |
| T | 523 | 3.7% |
| N | 470 | 3.4% |
| Other values (16) | 3321 |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 19917 | 9.7% |
| w | 19917 | 9.7% |
| n | 19917 | 9.7% |
| i | 13278 | 6.5% |
| r | 13278 | 6.5% |
| e | 13278 | 6.5% |
| m | 13278 | 6.5% |
| h | 13278 | 6.5% |
| g | 13278 | 6.5% |
| u | 6639 | 3.2% |
| Other values (9) | 59751 |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 4742 | |
| 2 | 4353 | |
| 4 | 4156 | |
| 9 | 4080 | |
| 8 | 4030 | |
| 1 | 3949 | |
| 6 | 3797 | |
| 0 | 3796 | |
| 3 | 3770 | |
| 5 | 3726 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 26569 | |
| / | 19917 | |
| ? | 6639 | 10.5% |
| : | 6639 | 10.5% |
| ; | 3538 | 5.6% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 6639 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 132 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 219781 | |
| Common | 110473 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 19917 | 9.1% |
| w | 19917 | 9.1% |
| n | 19917 | 9.1% |
| i | 13278 | 6.0% |
| r | 13278 | 6.0% |
| e | 13278 | 6.0% |
| m | 13278 | 6.0% |
| h | 13278 | 6.0% |
| g | 13278 | 6.0% |
| u | 6639 | 3.0% |
| Other values (35) | 73723 |
Common
| Value | Count | Frequency (%) |
| . | 26569 | |
| / | 19917 | |
| = | 6639 | 6.0% |
| ? | 6639 | 6.0% |
| : | 6639 | 6.0% |
| 7 | 4742 | 4.3% |
| 2 | 4353 | 3.9% |
| 4 | 4156 | 3.8% |
| 9 | 4080 | 3.7% |
| 8 | 4030 | 3.6% |
| Other values (8) | 22709 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 330254 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 26569 | 8.0% |
| t | 19917 | 6.0% |
| / | 19917 | 6.0% |
| w | 19917 | 6.0% |
| n | 19917 | 6.0% |
| i | 13278 | 4.0% |
| r | 13278 | 4.0% |
| e | 13278 | 4.0% |
| m | 13278 | 4.0% |
| h | 13278 | 4.0% |
| Other values (53) | 157627 |
Missing 
| Distinct | 167099 |
|---|---|
| Distinct (%) | 53.2% |
| Missing | 2047572 |
| Missing (%) | 86.7% |
| Memory size | 18.0 MiB |
Length
| Max length | 197629 |
|---|---|
| Median length | 2471 |
| Mean length | 67.07117849 |
| Min length | 1 |
Unique
| Unique | 146202 ? |
|---|---|
| Unique (%) | 46.6% |
Sample
| 1st row | Ninoe sp. B |
|---|---|
| 2nd row | {"hostGen":"Wallago","hostSpec":"after","hostBodyLoc":"stomach"}; Original USNPC preservative was a solution of 70% ethanol, 3% formalin, and 2% glycerine |
| 3rd row | {"hostGen":"Catoptrophorus","hostSpec":"semipalmatus","hostBodyLoc":"esophagus","hostFldNo":"JEBadley-426-23"}; Glycerin jelly |
| 4th row | Scripps Institution of Oceanography library archives about M.J. Johnson Phyllosoma Collection: specimens were stained with fast green and are mounted mostly in Canada balsam, Harleco synthetic resin or diatex. |
| 5th row | 8/28/28; 6527; Orcutt; Chamberlain Coll |
| Value | Count | Frequency (%) |
| of | 64177 | 2.1% |
| by | 48564 | 1.6% |
| and | 45626 | 1.5% |
| the | 43989 | 1.4% |
| coll | 38399 | 1.3% |
| 34601 | 1.1% | |
| a | 34537 | 1.1% |
| to | 31161 | 1.0% |
| was | 27077 | 0.9% |
| in | 26228 | 0.9% |
| Other values (150526) | 2642394 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2687843 | 12.8% | |
| e | 1443748 | 6.9% |
| o | 1130071 | 5.4% |
| a | 1127981 | 5.4% |
| i | 1022042 | 4.9% |
| t | 975063 | 4.6% |
| n | 951404 | 4.5% |
| r | 864162 | 4.1% |
| s | 821052 | 3.9% |
| l | 811731 | 3.9% |
| Other values (154) | 9218613 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12861737 | |
| Space Separator | 2687843 | 12.8% |
| Uppercase Letter | 2138640 | 10.2% |
| Other Punctuation | 1606079 | 7.6% |
| Decimal Number | 1358982 | 6.5% |
| Control | 113389 | 0.5% |
| Dash Punctuation | 109232 | 0.5% |
| Open Punctuation | 76596 | 0.4% |
| Close Punctuation | 76560 | 0.4% |
| Math Symbol | 14337 | 0.1% |
| Other values (10) | 10315 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1443748 | |
| o | 1130071 | 8.8% |
| a | 1127981 | 8.8% |
| i | 1022042 | 7.9% |
| t | 975063 | 7.6% |
| n | 951404 | 7.4% |
| r | 864162 | 6.7% |
| s | 821052 | 6.4% |
| l | 811731 | 6.3% |
| d | 545995 | 4.2% |
| Other values (49) | 3168488 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 247477 | 11.6% |
| C | 222887 | 10.4% |
| P | 130396 | 6.1% |
| N | 115110 | 5.4% |
| B | 112916 | 5.3% |
| M | 112131 | 5.2% |
| F | 107958 | 5.0% |
| T | 102498 | 4.8% |
| A | 96139 | 4.5% |
| L | 90325 | 4.2% |
| Other values (24) | 800803 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 474207 | |
| " | 320322 | |
| ; | 300035 | |
| , | 209967 | |
| : | 172539 | 10.7% |
| % | 43338 | 2.7% |
| / | 32232 | 2.0% |
| ! | 16738 | 1.0% |
| ' | 13401 | 0.8% |
| # | 11044 | 0.7% |
| Other values (8) | 12256 | 0.8% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 265876 | |
| 2 | 181718 | |
| 0 | 152398 | |
| 9 | 146413 | |
| 3 | 119730 | |
| 7 | 107513 | |
| 5 | 101445 | 7.5% |
| 4 | 98758 | 7.3% |
| 6 | 97441 | 7.2% |
| 8 | 87690 | 6.5% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 7530 | |
| + | 3523 | |
| | | 3144 | |
| ~ | 52 | 0.4% |
| > | 48 | 0.3% |
| < | 24 | 0.2% |
| × | 13 | 0.1% |
| ± | 3 | < 0.1% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 948 | |
| ♂ | 29 | 2.9% |
| ♀ | 11 | 1.1% |
| © | 6 | 0.6% |
| ⚥ | 5 | 0.5% |
Other Number
| Value | Count | Frequency (%) |
| ½ | 5 | |
| ³ | 1 | 12.5% |
| ¼ | 1 | 12.5% |
| ¹ | 1 | 12.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 108790 | |
| – | 435 | 0.4% |
| — | 7 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 51244 | |
| { | 22611 | |
| [ | 2741 | 3.6% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 51215 | |
| } | 22607 | |
| ] | 2738 | 3.6% |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 118 | |
| › | 4 | 3.3% |
| » | 1 | 0.8% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ́ | 93 | |
| ̧ | 31 | 20.0% |
| ̀ | 31 | 20.0% |
Control
| Value | Count | Frequency (%) |
| 112881 | ||
| 508 | 0.4% |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 107 | |
| « | 1 | 0.9% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ^ | 5 | |
| ´ | 1 | 16.7% |
Space Separator
| Value | Count | Frequency (%) |
| 2687843 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 8532 |
Other Letter
| Value | Count | Frequency (%) |
| º | 288 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 95 |
Format
| Value | Count | Frequency (%) |
| | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15000628 | |
| Common | 6052895 | |
| Inherited | 155 | < 0.1% |
| Greek | 32 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1443748 | 9.6% |
| o | 1130071 | 7.5% |
| a | 1127981 | 7.5% |
| i | 1022042 | 6.8% |
| t | 975063 | 6.5% |
| n | 951404 | 6.3% |
| r | 864162 | 5.8% |
| s | 821052 | 5.5% |
| l | 811731 | 5.4% |
| d | 545995 | 3.6% |
| Other values (82) | 5307379 |
Common
| Value | Count | Frequency (%) |
| 2687843 | ||
| . | 474207 | 7.8% |
| " | 320322 | 5.3% |
| ; | 300035 | 5.0% |
| 1 | 265876 | 4.4% |
| , | 209967 | 3.5% |
| 2 | 181718 | 3.0% |
| : | 172539 | 2.9% |
| 0 | 152398 | 2.5% |
| 9 | 146413 | 2.4% |
| Other values (58) | 1141577 |
Inherited
| Value | Count | Frequency (%) |
| ́ | 93 | |
| ̧ | 31 | 20.0% |
| ̀ | 31 | 20.0% |
Greek
| Value | Count | Frequency (%) |
| μ | 32 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21050335 | |
| None | 2455 | < 0.1% |
| Punctuation | 720 | < 0.1% |
| Diacriticals | 155 | < 0.1% |
| Misc Symbols | 45 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2687843 | 12.8% | |
| e | 1443748 | 6.9% |
| o | 1130071 | 5.4% |
| a | 1127981 | 5.4% |
| i | 1022042 | 4.9% |
| t | 975063 | 4.6% |
| n | 951404 | 4.5% |
| r | 864162 | 4.1% |
| s | 821052 | 3.9% |
| l | 811731 | 3.9% |
| Other values (86) | 9215238 |
None
| Value | Count | Frequency (%) |
| ° | 948 | |
| é | 319 | 13.0% |
| º | 288 | 11.7% |
| í | 224 | 9.1% |
| ñ | 95 | 3.9% |
| á | 80 | 3.3% |
| · | 67 | 2.7% |
| è | 48 | 2.0% |
| ü | 45 | 1.8% |
| ã | 44 | 1.8% |
| Other values (45) | 297 | 12.1% |
Punctuation
| Value | Count | Frequency (%) |
| – | 435 | |
| ” | 118 | 16.4% |
| “ | 107 | 14.9% |
| … | 41 | 5.7% |
| • | 8 | 1.1% |
| — | 7 | 1.0% |
| › | 4 | 0.6% |
Diacriticals
| Value | Count | Frequency (%) |
| ́ | 93 | |
| ̧ | 31 | 20.0% |
| ̀ | 31 | 20.0% |
Misc Symbols
| Value | Count | Frequency (%) |
| ♂ | 29 | |
| ♀ | 11 | 24.4% |
| ⚥ | 5 | 11.1% |
verbatimLabel
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361471 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 45 |
|---|---|
| Median length | 26.5 |
| Mean length | 26.5 |
| Min length | 8 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | -15.6527 |
|---|---|
| 2nd row | North America, Canada, Nunavut, Baffin Island |
| Value | Count | Frequency (%) |
| 15.6527 | 1 | |
| north | 1 | |
| america | 1 | |
| canada | 1 | |
| nunavut | 1 | |
| baffin | 1 | |
| island | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 7 | 13.2% |
| 5 | 9.4% | |
| n | 4 | 7.5% |
| , | 3 | 5.7% |
| i | 2 | 3.8% |
| d | 2 | 3.8% |
| u | 2 | 3.8% |
| t | 2 | 3.8% |
| r | 2 | 3.8% |
| N | 2 | 3.8% |
| Other values (20) | 22 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 31 | |
| Uppercase Letter | 6 | 11.3% |
| Decimal Number | 6 | 11.3% |
| Space Separator | 5 | 9.4% |
| Other Punctuation | 4 | 7.5% |
| Dash Punctuation | 1 | 1.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 7 | |
| n | 4 | |
| i | 2 | 6.5% |
| d | 2 | 6.5% |
| u | 2 | 6.5% |
| t | 2 | 6.5% |
| r | 2 | 6.5% |
| f | 2 | 6.5% |
| v | 1 | 3.2% |
| s | 1 | 3.2% |
| Other values (6) | 6 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 2 | |
| B | 1 | |
| I | 1 | |
| C | 1 | |
| A | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 2 | |
| 1 | 1 | |
| 7 | 1 | |
| 2 | 1 | |
| 6 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 3 | |
| . | 1 | 25.0% |
Space Separator
| Value | Count | Frequency (%) |
| 5 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 37 | |
| Common | 16 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 7 | |
| n | 4 | 10.8% |
| i | 2 | 5.4% |
| d | 2 | 5.4% |
| u | 2 | 5.4% |
| t | 2 | 5.4% |
| r | 2 | 5.4% |
| N | 2 | 5.4% |
| f | 2 | 5.4% |
| v | 1 | 2.7% |
| Other values (11) | 11 |
Common
| Value | Count | Frequency (%) |
| 5 | ||
| , | 3 | |
| 5 | 2 | 12.5% |
| - | 1 | 6.2% |
| 1 | 1 | 6.2% |
| 7 | 1 | 6.2% |
| 2 | 1 | 6.2% |
| 6 | 1 | 6.2% |
| . | 1 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 53 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 7 | 13.2% |
| 5 | 9.4% | |
| n | 4 | 7.5% |
| , | 3 | 5.7% |
| i | 2 | 3.8% |
| d | 2 | 3.8% |
| u | 2 | 3.8% |
| t | 2 | 3.8% |
| r | 2 | 3.8% |
| N | 2 | 3.8% |
| Other values (20) | 22 |
materialSampleID
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361471 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 7 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 135.777 |
|---|---|
| 2nd row | NORTH_AMERICA |
| Value | Count | Frequency (%) |
| 135.777 | 1 | |
| north_america | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | 3 | |
| R | 2 | 10.0% |
| A | 2 | 10.0% |
| 1 | 1 | 5.0% |
| 3 | 1 | 5.0% |
| 5 | 1 | 5.0% |
| . | 1 | 5.0% |
| N | 1 | 5.0% |
| O | 1 | 5.0% |
| T | 1 | 5.0% |
| Other values (6) | 6 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 12 | |
| Decimal Number | 6 | |
| Other Punctuation | 1 | 5.0% |
| Connector Punctuation | 1 | 5.0% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 2 | |
| A | 2 | |
| N | 1 | |
| O | 1 | |
| T | 1 | |
| H | 1 | |
| M | 1 | |
| E | 1 | |
| I | 1 | |
| C | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 3 | |
| 1 | 1 | 16.7% |
| 3 | 1 | 16.7% |
| 5 | 1 | 16.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12 | |
| Common | 8 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 2 | |
| A | 2 | |
| N | 1 | |
| O | 1 | |
| T | 1 | |
| H | 1 | |
| M | 1 | |
| E | 1 | |
| I | 1 | |
| C | 1 |
Common
| Value | Count | Frequency (%) |
| 7 | 3 | |
| 1 | 1 | 12.5% |
| 3 | 1 | 12.5% |
| 5 | 1 | 12.5% |
| . | 1 | 12.5% |
| _ | 1 | 12.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7 | 3 | |
| R | 2 | 10.0% |
| A | 2 | 10.0% |
| 1 | 1 | 5.0% |
| 3 | 1 | 5.0% |
| 5 | 1 | 5.0% |
| . | 1 | 5.0% |
| N | 1 | 5.0% |
| O | 1 | 5.0% |
| T | 1 | 5.0% |
| Other values (6) | 6 |
eventType
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361472 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Baffin Island |
|---|
| Value | Count | Frequency (%) |
| baffin | 1 | |
| island | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2 | |
| f | 2 | |
| n | 2 | |
| B | 1 | |
| i | 1 | |
| 1 | ||
| I | 1 | |
| s | 1 | |
| l | 1 | |
| d | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10 | |
| Uppercase Letter | 2 | 15.4% |
| Space Separator | 1 | 7.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2 | |
| f | 2 | |
| n | 2 | |
| i | 1 | |
| s | 1 | |
| l | 1 | |
| d | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 1 | |
| I | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12 | |
| Common | 1 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2 | |
| f | 2 | |
| n | 2 | |
| B | 1 | |
| i | 1 | |
| I | 1 | |
| s | 1 | |
| l | 1 | |
| d | 1 |
Common
| Value | Count | Frequency (%) |
| 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2 | |
| f | 2 | |
| n | 2 | |
| B | 1 | |
| i | 1 | |
| 1 | ||
| I | 1 | |
| s | 1 | |
| l | 1 | |
| d | 1 |
fieldNumber
Text
Missing 
| Distinct | 48666 |
|---|---|
| Distinct (%) | 24.7% |
| Missing | 2164715 |
| Missing (%) | 91.7% |
| Memory size | 18.0 MiB |
Length
| Max length | 97 |
|---|---|
| Median length | 64 |
| Mean length | 12.74823895 |
| Min length | 1 |
Unique
| Unique | 22801 ? |
|---|---|
| Unique (%) | 11.6% |
Sample
| 1st row | MMS-MAMES/B3:M4-4 |
|---|---|
| 2nd row | USARP/EL/9/740/USC |
| 3rd row | M165503; H.29-118 |
| 4th row | USFC/A5151 |
| 5th row | USARP/EL/6/369/USC |
| Value | Count | Frequency (%) |
| vgs | 4890 | 1.9% |
| mms-mafla/jar | 4303 | 1.7% |
| jtw | 3701 | 1.4% |
| bolland/rfb | 1880 | 0.7% |
| bbc | 1566 | 0.6% |
| humes | 1397 | 0.5% |
| 1387 | 0.5% | |
| jpem | 1304 | 0.5% |
| lwk | 1042 | 0.4% |
| lk | 1037 | 0.4% |
| Other values (46561) | 233531 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 189196 | 7.5% |
| S | 180164 | 7.2% |
| - | 171500 | 6.8% |
| 1 | 135819 | 5.4% |
| M | 134244 | 5.4% |
| 0 | 125696 | 5.0% |
| A | 119355 | 4.8% |
| 2 | 118372 | 4.7% |
| C | 100849 | 4.0% |
| 3 | 83038 | 3.3% |
| Other values (73) | 1150085 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1136258 | |
| Decimal Number | 869208 | |
| Other Punctuation | 223269 | 8.9% |
| Dash Punctuation | 171500 | 6.8% |
| Space Separator | 59280 | 2.4% |
| Lowercase Letter | 45487 | 1.8% |
| Connector Punctuation | 1901 | 0.1% |
| Open Punctuation | 657 | < 0.1% |
| Close Punctuation | 657 | < 0.1% |
| Math Symbol | 100 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 180164 | |
| M | 134244 | |
| A | 119355 | |
| C | 100849 | 8.9% |
| U | 72371 | 6.4% |
| F | 63441 | 5.6% |
| L | 52209 | 4.6% |
| I | 51174 | 4.5% |
| R | 50110 | 4.4% |
| B | 48743 | 4.3% |
| Other values (16) | 263598 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 7455 | |
| r | 6894 | |
| a | 6685 | |
| o | 3555 | |
| l | 2812 | 6.2% |
| i | 2524 | 5.5% |
| u | 2399 | 5.3% |
| s | 2313 | 5.1% |
| t | 2095 | 4.6% |
| m | 1992 | 4.4% |
| Other values (16) | 6763 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 189196 | |
| : | 20778 | 9.3% |
| ; | 9334 | 4.2% |
| . | 2271 | 1.0% |
| , | 940 | 0.4% |
| # | 553 | 0.2% |
| \ | 93 | < 0.1% |
| ? | 36 | < 0.1% |
| & | 34 | < 0.1% |
| ' | 18 | < 0.1% |
| Other values (3) | 16 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 135819 | |
| 0 | 125696 | |
| 2 | 118372 | |
| 3 | 83038 | |
| 5 | 82324 | |
| 4 | 72440 | |
| 7 | 68844 | |
| 6 | 67012 | |
| 8 | 59728 | |
| 9 | 55935 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 98 | |
| = | 2 | 2.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 171500 |
Space Separator
| Value | Count | Frequency (%) |
| 59280 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1901 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 657 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 657 |
Final Punctuation
| Value | Count | Frequency (%) |
| › | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1326573 | |
| Latin | 1181745 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 180164 | |
| M | 134244 | |
| A | 119355 | 10.1% |
| C | 100849 | 8.5% |
| U | 72371 | 6.1% |
| F | 63441 | 5.4% |
| L | 52209 | 4.4% |
| I | 51174 | 4.3% |
| R | 50110 | 4.2% |
| B | 48743 | 4.1% |
| Other values (42) | 309085 |
Common
| Value | Count | Frequency (%) |
| / | 189196 | |
| - | 171500 | |
| 1 | 135819 | |
| 0 | 125696 | |
| 2 | 118372 | |
| 3 | 83038 | 6.3% |
| 5 | 82324 | 6.2% |
| 4 | 72440 | 5.5% |
| 7 | 68844 | 5.2% |
| 6 | 67012 | 5.1% |
| Other values (21) | 212332 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2508317 | |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 189196 | 7.5% |
| S | 180164 | 7.2% |
| - | 171500 | 6.8% |
| 1 | 135819 | 5.4% |
| M | 134244 | 5.4% |
| 0 | 125696 | 5.0% |
| A | 119355 | 4.8% |
| 2 | 118372 | 4.7% |
| C | 100849 | 4.0% |
| 3 | 83038 | 3.3% |
| Other values (72) | 1150084 |
Punctuation
| Value | Count | Frequency (%) |
| › | 1 |
eventDate
Text
Missing 
| Distinct | 79092 |
|---|---|
| Distinct (%) | 4.1% |
| Missing | 419648 |
| Missing (%) | 17.8% |
| Memory size | 18.0 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 10 |
| Mean length | 9.989250319 |
| Min length | 4 |
Unique
| Unique | 14082 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | 1981-04-24 |
|---|---|
| 2nd row | 1952-03-30 |
| 3rd row | 1958-08-06 |
| 4th row | 1900-11 |
| 5th row | 1988-08-20 |
| Value | Count | Frequency (%) |
| 1915 | 2128 | 0.1% |
| 1913 | 1918 | 0.1% |
| 1916 | 1707 | 0.1% |
| 1891 | 1468 | 0.1% |
| 1982-07-21 | 1436 | 0.1% |
| 1981-07-06 | 1349 | 0.1% |
| 1923 | 1342 | 0.1% |
| 1982-11-19 | 1332 | 0.1% |
| 1880 | 1329 | 0.1% |
| 1929 | 1317 | 0.1% |
| Other values (79082) | 1926499 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 3722241 | |
| 1 | 3671544 | |
| 0 | 2955423 | |
| 9 | 2451007 | |
| 2 | 1415307 | 7.3% |
| 8 | 1113644 | 5.7% |
| 7 | 900586 | 4.6% |
| 6 | 896747 | 4.6% |
| 3 | 759476 | 3.9% |
| 5 | 731380 | 3.8% |
| Other values (8) | 780021 | 4.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 15582458 | |
| Dash Punctuation | 3722241 | 19.2% |
| Other Punctuation | 92670 | 0.5% |
| Lowercase Letter | 6 | < 0.1% |
| Uppercase Letter | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 3671544 | |
| 0 | 2955423 | |
| 9 | 2451007 | |
| 2 | 1415307 | 9.1% |
| 8 | 1113644 | 7.1% |
| 7 | 900586 | 5.8% |
| 6 | 896747 | 5.8% |
| 3 | 759476 | 4.9% |
| 5 | 731380 | 4.7% |
| 4 | 687344 | 4.4% |
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 2 | |
| n | 1 | |
| a | 1 | |
| v | 1 | |
| t | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3722241 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 92670 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 19397369 | |
| Latin | 7 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 3722241 | |
| 1 | 3671544 | |
| 0 | 2955423 | |
| 9 | 2451007 | |
| 2 | 1415307 | 7.3% |
| 8 | 1113644 | 5.7% |
| 7 | 900586 | 4.6% |
| 6 | 896747 | 4.6% |
| 3 | 759476 | 3.9% |
| 5 | 731380 | 3.8% |
| Other values (2) | 780014 | 4.0% |
Latin
| Value | Count | Frequency (%) |
| u | 2 | |
| N | 1 | |
| n | 1 | |
| a | 1 | |
| v | 1 | |
| t | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19397376 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 3722241 | |
| 1 | 3671544 | |
| 0 | 2955423 | |
| 9 | 2451007 | |
| 2 | 1415307 | 7.3% |
| 8 | 1113644 | 5.7% |
| 7 | 900586 | 4.6% |
| 6 | 896747 | 4.6% |
| 3 | 759476 | 3.9% |
| 5 | 731380 | 3.8% |
| Other values (8) | 780021 | 4.0% |
startDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 669491 |
| Missing (%) | 28.4% |
| Memory size | 18.0 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.764176569 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 114 |
|---|---|
| 2nd row | 90 |
| 3rd row | 218 |
| 4th row | 233 |
| 5th row | 189 |
| Value | Count | Frequency (%) |
| 202 | 8899 | 0.5% |
| 196 | 7844 | 0.5% |
| 199 | 7829 | 0.5% |
| 206 | 7783 | 0.5% |
| 210 | 7720 | 0.5% |
| 187 | 7619 | 0.5% |
| 201 | 7549 | 0.4% |
| 200 | 7529 | 0.4% |
| 219 | 7370 | 0.4% |
| 197 | 7339 | 0.4% |
| Other values (356) | 1614501 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 936211 | |
| 2 | 897252 | |
| 3 | 558831 | |
| 4 | 348360 | 7.4% |
| 5 | 343495 | 7.3% |
| 6 | 328341 | 7.0% |
| 0 | 325792 | 7.0% |
| 9 | 319075 | 6.8% |
| 7 | 311518 | 6.7% |
| 8 | 308062 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4676937 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 936211 | |
| 2 | 897252 | |
| 3 | 558831 | |
| 4 | 348360 | 7.4% |
| 5 | 343495 | 7.3% |
| 6 | 328341 | 7.0% |
| 0 | 325792 | 7.0% |
| 9 | 319075 | 6.8% |
| 7 | 311518 | 6.7% |
| 8 | 308062 | 6.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4676937 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 936211 | |
| 2 | 897252 | |
| 3 | 558831 | |
| 4 | 348360 | 7.4% |
| 5 | 343495 | 7.3% |
| 6 | 328341 | 7.0% |
| 0 | 325792 | 7.0% |
| 9 | 319075 | 6.8% |
| 7 | 311518 | 6.7% |
| 8 | 308062 | 6.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4676937 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 936211 | |
| 2 | 897252 | |
| 3 | 558831 | |
| 4 | 348360 | 7.4% |
| 5 | 343495 | 7.3% |
| 6 | 328341 | 7.0% |
| 0 | 325792 | 7.0% |
| 9 | 319075 | 6.8% |
| 7 | 311518 | 6.7% |
| 8 | 308062 | 6.6% |
endDayOfYear
Text
Missing 
| Distinct | 367 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 669490 |
| Missing (%) | 28.4% |
| Memory size | 18.0 MiB |
Length
| Max length | 69 |
|---|---|
| Median length | 3 |
| Mean length | 2.765497053 |
| Min length | 1 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 114 |
|---|---|
| 2nd row | 90 |
| 3rd row | 218 |
| 4th row | 233 |
| 5th row | 189 |
| Value | Count | Frequency (%) |
| 202 | 8881 | 0.5% |
| 210 | 7894 | 0.5% |
| 200 | 7857 | 0.5% |
| 196 | 7822 | 0.5% |
| 191 | 7738 | 0.5% |
| 199 | 7653 | 0.5% |
| 206 | 7645 | 0.5% |
| 197 | 7551 | 0.4% |
| 187 | 7541 | 0.4% |
| 201 | 7478 | 0.4% |
| Other values (364) | 1613930 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 936531 | |
| 2 | 898023 | |
| 3 | 560326 | |
| 4 | 349297 | 7.5% |
| 5 | 343958 | 7.4% |
| 0 | 326366 | 7.0% |
| 6 | 325151 | 6.9% |
| 9 | 317474 | 6.8% |
| 7 | 312820 | 6.7% |
| 8 | 309159 | 6.6% |
| Other values (26) | 69 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4679105 | |
| Lowercase Letter | 52 | < 0.1% |
| Space Separator | 7 | < 0.1% |
| Uppercase Letter | 7 | < 0.1% |
| Other Punctuation | 1 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 11 | |
| i | 6 | |
| t | 5 | |
| e | 4 | 7.7% |
| o | 4 | 7.7% |
| n | 4 | 7.7% |
| a | 3 | 5.8% |
| s | 3 | 5.8% |
| b | 2 | 3.8% |
| l | 2 | 3.8% |
| Other values (7) | 8 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 936531 | |
| 2 | 898023 | |
| 3 | 560326 | |
| 4 | 349297 | 7.5% |
| 5 | 343958 | 7.4% |
| 0 | 326366 | 7.0% |
| 6 | 325151 | 6.9% |
| 9 | 317474 | 6.8% |
| 7 | 312820 | 6.7% |
| 8 | 309159 | 6.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 2 | |
| D | 2 | |
| T | 1 | |
| N | 1 | |
| H | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 7 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4679115 | |
| Latin | 59 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 11 | |
| i | 6 | 10.2% |
| t | 5 | 8.5% |
| e | 4 | 6.8% |
| o | 4 | 6.8% |
| n | 4 | 6.8% |
| a | 3 | 5.1% |
| s | 3 | 5.1% |
| F | 2 | 3.4% |
| b | 2 | 3.4% |
| Other values (12) | 15 |
Common
| Value | Count | Frequency (%) |
| 1 | 936531 | |
| 2 | 898023 | |
| 3 | 560326 | |
| 4 | 349297 | 7.5% |
| 5 | 343958 | 7.4% |
| 0 | 326366 | 7.0% |
| 6 | 325151 | 6.9% |
| 9 | 317474 | 6.8% |
| 7 | 312820 | 6.7% |
| 8 | 309159 | 6.6% |
| Other values (4) | 10 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4679174 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 936531 | |
| 2 | 898023 | |
| 3 | 560326 | |
| 4 | 349297 | 7.5% |
| 5 | 343958 | 7.4% |
| 0 | 326366 | 7.0% |
| 6 | 325151 | 6.9% |
| 9 | 317474 | 6.8% |
| 7 | 312820 | 6.7% |
| 8 | 309159 | 6.6% |
| Other values (26) | 69 | < 0.1% |
year
Text
Missing 
| Distinct | 301 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 423106 |
| Missing (%) | 17.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 47 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1981 |
|---|---|
| 2nd row | 1952 |
| 3rd row | 1958 |
| 4th row | 1900 |
| 5th row | 1988 |
| Value | Count | Frequency (%) |
| 1966 | 36263 | 1.9% |
| 1967 | 33378 | 1.7% |
| 1964 | 33172 | 1.7% |
| 1977 | 31555 | 1.6% |
| 1968 | 31427 | 1.6% |
| 1965 | 29410 | 1.5% |
| 1969 | 28034 | 1.4% |
| 1963 | 25250 | 1.3% |
| 1970 | 25083 | 1.3% |
| 1971 | 24744 | 1.3% |
| Other values (291) | 1640051 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2184214 | |
| 9 | 2015776 | |
| 8 | 680289 | 8.8% |
| 0 | 504133 | 6.5% |
| 6 | 492091 | 6.3% |
| 7 | 448375 | 5.8% |
| 2 | 417461 | 5.4% |
| 5 | 340164 | 4.4% |
| 4 | 339115 | 4.4% |
| 3 | 331850 | 4.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7753468 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2184214 | |
| 9 | 2015776 | |
| 8 | 680289 | 8.8% |
| 0 | 504133 | 6.5% |
| 6 | 492091 | 6.3% |
| 7 | 448375 | 5.8% |
| 2 | 417461 | 5.4% |
| 5 | 340164 | 4.4% |
| 4 | 339115 | 4.4% |
| 3 | 331850 | 4.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7753468 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2184214 | |
| 9 | 2015776 | |
| 8 | 680289 | 8.8% |
| 0 | 504133 | 6.5% |
| 6 | 492091 | 6.3% |
| 7 | 448375 | 5.8% |
| 2 | 417461 | 5.4% |
| 5 | 340164 | 4.4% |
| 4 | 339115 | 4.4% |
| 3 | 331850 | 4.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7753468 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2184214 | |
| 9 | 2015776 | |
| 8 | 680289 | 8.8% |
| 0 | 504133 | 6.5% |
| 6 | 492091 | 6.3% |
| 7 | 448375 | 5.8% |
| 2 | 417461 | 5.4% |
| 5 | 340164 | 4.4% |
| 4 | 339115 | 4.4% |
| 3 | 331850 | 4.3% |
month
Text
Missing 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 542654 |
| Missing (%) | 23.0% |
| Memory size | 18.0 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.174502795 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 4 |
|---|---|
| 2nd row | 3 |
| 3rd row | 8 |
| 4th row | 11 |
| 5th row | 8 |
| Value | Count | Frequency (%) |
| 7 | 238033 | |
| 8 | 219764 | |
| 6 | 196912 | |
| 5 | 185254 | |
| 4 | 151950 | |
| 9 | 150297 | |
| 3 | 138712 | |
| 10 | 121082 | |
| 2 | 119259 | |
| 11 | 110527 | |
| Other values (2) | 187029 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 529165 | |
| 7 | 238033 | |
| 8 | 219764 | |
| 2 | 205039 | 9.6% |
| 6 | 196912 | 9.2% |
| 5 | 185254 | 8.7% |
| 4 | 151950 | 7.1% |
| 9 | 150297 | 7.0% |
| 3 | 138712 | 6.5% |
| 0 | 121082 | 5.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2136208 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 529165 | |
| 7 | 238033 | |
| 8 | 219764 | |
| 2 | 205039 | 9.6% |
| 6 | 196912 | 9.2% |
| 5 | 185254 | 8.7% |
| 4 | 151950 | 7.1% |
| 9 | 150297 | 7.0% |
| 3 | 138712 | 6.5% |
| 0 | 121082 | 5.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2136208 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 529165 | |
| 7 | 238033 | |
| 8 | 219764 | |
| 2 | 205039 | 9.6% |
| 6 | 196912 | 9.2% |
| 5 | 185254 | 8.7% |
| 4 | 151950 | 7.1% |
| 9 | 150297 | 7.0% |
| 3 | 138712 | 6.5% |
| 0 | 121082 | 5.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2136208 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 529165 | |
| 7 | 238033 | |
| 8 | 219764 | |
| 2 | 205039 | 9.6% |
| 6 | 196912 | 9.2% |
| 5 | 185254 | 8.7% |
| 4 | 151950 | 7.1% |
| 9 | 150297 | 7.0% |
| 3 | 138712 | 6.5% |
| 0 | 121082 | 5.7% |
day
Text
Missing 
| Distinct | 32 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 762160 |
| Missing (%) | 32.3% |
| Memory size | 18.0 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 1.709295179 |
| Min length | 1 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 24 |
|---|---|
| 2nd row | 30 |
| 3rd row | 6 |
| 4th row | 20 |
| 5th row | 8 |
| Value | Count | Frequency (%) |
| 15 | 57416 | 3.6% |
| 10 | 57058 | 3.6% |
| 20 | 56592 | 3.5% |
| 18 | 55176 | 3.4% |
| 19 | 55147 | 3.4% |
| 13 | 54553 | 3.4% |
| 21 | 54123 | 3.4% |
| 8 | 53877 | 3.4% |
| 16 | 53195 | 3.3% |
| 6 | 52866 | 3.3% |
| Other values (22) | 1049310 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 725188 | |
| 2 | 674547 | |
| 3 | 231511 | 8.5% |
| 5 | 161969 | 5.9% |
| 0 | 160494 | 5.9% |
| 8 | 159897 | 5.8% |
| 6 | 156693 | 5.7% |
| 4 | 155717 | 5.7% |
| 7 | 154240 | 5.6% |
| 9 | 153439 | 5.6% |
| Other values (3) | 3 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2733695 | |
| Uppercase Letter | 3 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 725188 | |
| 2 | 674547 | |
| 3 | 231511 | 8.5% |
| 5 | 161969 | 5.9% |
| 0 | 160494 | 5.9% |
| 8 | 159897 | 5.8% |
| 6 | 156693 | 5.7% |
| 4 | 155717 | 5.7% |
| 7 | 154240 | 5.6% |
| 9 | 153439 | 5.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 1 | |
| P | 1 | |
| S | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2733695 | |
| Latin | 3 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 725188 | |
| 2 | 674547 | |
| 3 | 231511 | 8.5% |
| 5 | 161969 | 5.9% |
| 0 | 160494 | 5.9% |
| 8 | 159897 | 5.8% |
| 6 | 156693 | 5.7% |
| 4 | 155717 | 5.7% |
| 7 | 154240 | 5.6% |
| 9 | 153439 | 5.6% |
Latin
| Value | Count | Frequency (%) |
| G | 1 | |
| P | 1 | |
| S | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2733698 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 725188 | |
| 2 | 674547 | |
| 3 | 231511 | 8.5% |
| 5 | 161969 | 5.9% |
| 0 | 160494 | 5.9% |
| 8 | 159897 | 5.8% |
| 6 | 156693 | 5.7% |
| 4 | 155717 | 5.7% |
| 7 | 154240 | 5.6% |
| 9 | 153439 | 5.6% |
| Other values (3) | 3 | < 0.1% |
Missing 
| Distinct | 182342 |
|---|---|
| Distinct (%) | 16.5% |
| Missing | 1255739 |
| Missing (%) | 53.2% |
| Memory size | 18.0 MiB |
Length
| Max length | 194 |
|---|---|
| Median length | 11 |
| Mean length | 13.22736662 |
| Min length | 1 |
Unique
| Unique | 75346 ? |
|---|---|
| Unique (%) | 6.8% |
Sample
| 1st row | 24 APR 1981 |
|---|---|
| 2nd row | 6 Aug 1958 |
| 3rd row | 24 Jun 1934 |
| 4th row | 24 Mar 1974 |
| 5th row | 23-29 January 1885 |
| Value | Count | Frequency (%) |
| 436209 | 11.7% | |
| 00 | 204346 | 5.5% |
| 0000 | 95956 | 2.6% |
| aug | 94310 | 2.5% |
| may | 93493 | 2.5% |
| jul | 93386 | 2.5% |
| jun | 83860 | 2.3% |
| apr | 78240 | 2.1% |
| mar | 71932 | 1.9% |
| sep | 67673 | 1.8% |
| Other values (47806) | 2396571 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2610242 | ||
| 1 | 1589790 | 10.9% |
| 0 | 1327954 | 9.1% |
| 9 | 1155041 | 7.9% |
| - | 1063519 | 7.3% |
| 2 | 604441 | 4.1% |
| 8 | 461134 | 3.2% |
| 6 | 404101 | 2.8% |
| 7 | 366280 | 2.5% |
| 3 | 329735 | 2.3% |
| Other values (87) | 4713712 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6814007 | |
| Space Separator | 2610242 | 17.8% |
| Lowercase Letter | 2418926 | 16.5% |
| Uppercase Letter | 1341399 | 9.2% |
| Dash Punctuation | 1063526 | 7.3% |
| Other Punctuation | 358825 | 2.5% |
| Open Punctuation | 9406 | 0.1% |
| Close Punctuation | 9404 | 0.1% |
| Connector Punctuation | 110 | < 0.1% |
| Math Symbol | 100 | < 0.1% |
| Other values (2) | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 261683 | |
| a | 255572 | |
| r | 254995 | |
| e | 237935 | 9.8% |
| n | 178794 | 7.4% |
| c | 139538 | 5.8% |
| p | 138255 | 5.7% |
| y | 133747 | 5.5% |
| t | 118182 | 4.9% |
| b | 104610 | 4.3% |
| Other values (21) | 595615 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 251561 | |
| A | 224498 | |
| M | 175768 | |
| N | 91005 | 6.8% |
| S | 85919 | 6.4% |
| O | 77756 | 5.8% |
| F | 69162 | 5.2% |
| T | 53522 | 4.0% |
| U | 46648 | 3.5% |
| D | 46534 | 3.5% |
| Other values (15) | 219026 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 180487 | |
| : | 100182 | |
| ; | 32334 | 9.0% |
| . | 27778 | 7.7% |
| , | 15385 | 4.3% |
| ' | 1482 | 0.4% |
| * | 612 | 0.2% |
| ? | 279 | 0.1% |
| ! | 136 | < 0.1% |
| & | 104 | < 0.1% |
| Other values (3) | 46 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1589790 | |
| 0 | 1327954 | |
| 9 | 1155041 | |
| 2 | 604441 | 8.9% |
| 8 | 461134 | 6.8% |
| 6 | 404101 | 5.9% |
| 7 | 366280 | 5.4% |
| 3 | 329735 | 4.8% |
| 5 | 296345 | 4.3% |
| 4 | 279186 | 4.1% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 48 | |
| + | 42 | |
| = | 6 | 6.0% |
| ~ | 2 | 2.0% |
| < | 1 | 1.0% |
| ± | 1 | 1.0% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 8644 | |
| ( | 759 | 8.1% |
| { | 3 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 8641 | |
| ) | 760 | 8.1% |
| } | 3 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1063519 | |
| – | 7 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2610242 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 110 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 2 |
Format
| Value | Count | Frequency (%) |
| | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10865624 | |
| Latin | 3760325 | 25.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| u | 261683 | 7.0% |
| a | 255572 | 6.8% |
| r | 254995 | 6.8% |
| J | 251561 | 6.7% |
| e | 237935 | 6.3% |
| A | 224498 | 6.0% |
| n | 178794 | 4.8% |
| M | 175768 | 4.7% |
| c | 139538 | 3.7% |
| p | 138255 | 3.7% |
| Other values (46) | 1641726 |
Common
| Value | Count | Frequency (%) |
| 2610242 | ||
| 1 | 1589790 | |
| 0 | 1327954 | |
| 9 | 1155041 | |
| - | 1063519 | |
| 2 | 604441 | 5.6% |
| 8 | 461134 | 4.2% |
| 6 | 404101 | 3.7% |
| 7 | 366280 | 3.4% |
| 3 | 329735 | 3.0% |
| Other values (31) | 953387 | 8.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14625911 | |
| None | 29 | < 0.1% |
| Punctuation | 9 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2610242 | ||
| 1 | 1589790 | 10.9% |
| 0 | 1327954 | 9.1% |
| 9 | 1155041 | 7.9% |
| - | 1063519 | 7.3% |
| 2 | 604441 | 4.1% |
| 8 | 461134 | 3.2% |
| 6 | 404101 | 2.8% |
| 7 | 366280 | 2.5% |
| 3 | 329735 | 2.3% |
| Other values (78) | 4713674 |
None
| Value | Count | Frequency (%) |
| é | 14 | |
| û | 5 | 17.2% |
| ü | 3 | 10.3% |
| ä | 2 | 6.9% |
| ô | 2 | 6.9% |
| ½ | 2 | 6.9% |
| ± | 1 | 3.4% |
Punctuation
| Value | Count | Frequency (%) |
| – | 7 | |
| | 2 | 22.2% |
habitat
Text
Missing 
| Distinct | 72844 |
|---|---|
| Distinct (%) | 39.6% |
| Missing | 2177646 |
| Missing (%) | 92.2% |
| Memory size | 18.0 MiB |
Length
| Max length | 795 |
|---|---|
| Median length | 504 |
| Mean length | 30.83063424 |
| Min length | 1 |
Unique
| Unique | 57211 ? |
|---|---|
| Unique (%) | 31.1% |
Sample
| 1st row | abandoned field |
|---|---|
| 2nd row | In wet mixed hardwood-pine-podocarpus forest. |
| 3rd row | Ecological remarks by collector(s): yes |
| 4th row | Rainforest |
| 5th row | Tropical dry forest |
| Value | Count | Frequency (%) |
| forest | 43320 | 5.0% |
| on | 24894 | 2.9% |
| and | 21426 | 2.5% |
| in | 20924 | 2.4% |
| with | 15194 | 1.8% |
| of | 14970 | 1.7% |
| by | 14737 | 1.7% |
| remarks | 12371 | 1.4% |
| ecological | 12371 | 1.4% |
| collector(s | 12367 | 1.4% |
| Other values (24035) | 672995 |
Most occurring characters
| Value | Count | Frequency (%) |
| 681742 | 12.0% | |
| e | 506903 | 8.9% |
| a | 436200 | 7.7% |
| o | 418533 | 7.4% |
| r | 374871 | 6.6% |
| s | 360402 | 6.4% |
| n | 320370 | 5.7% |
| i | 282376 | 5.0% |
| t | 273985 | 4.8% |
| l | 250910 | 4.4% |
| Other values (121) | 1761211 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4546359 | |
| Space Separator | 681742 | 12.0% |
| Uppercase Letter | 241805 | 4.3% |
| Other Punctuation | 139735 | 2.5% |
| Close Punctuation | 15495 | 0.3% |
| Open Punctuation | 15478 | 0.3% |
| Decimal Number | 14355 | 0.3% |
| Dash Punctuation | 11022 | 0.2% |
| Math Symbol | 1458 | < 0.1% |
| Other Symbol | 31 | < 0.1% |
| Other values (6) | 23 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 506903 | |
| a | 436200 | 9.6% |
| o | 418533 | 9.2% |
| r | 374871 | 8.2% |
| s | 360402 | 7.9% |
| n | 320370 | 7.0% |
| i | 282376 | 6.2% |
| t | 273985 | 6.0% |
| l | 250910 | 5.5% |
| d | 202477 | 4.5% |
| Other values (41) | 1119332 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 27255 | 11.3% |
| E | 19870 | 8.2% |
| M | 19282 | 8.0% |
| C | 15226 | 6.3% |
| R | 14950 | 6.2% |
| P | 14353 | 5.9% |
| O | 13923 | 5.8% |
| F | 13595 | 5.6% |
| A | 13336 | 5.5% |
| T | 12982 | 5.4% |
| Other values (20) | 77033 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 56768 | |
| . | 53985 | |
| : | 13864 | 9.9% |
| ; | 7795 | 5.6% |
| & | 2835 | 2.0% |
| / | 1969 | 1.4% |
| " | 1107 | 0.8% |
| ' | 725 | 0.5% |
| ? | 229 | 0.2% |
| % | 223 | 0.2% |
| Other values (6) | 235 | 0.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4308 | |
| 1 | 1914 | |
| 2 | 1738 | |
| 3 | 1723 | 12.0% |
| 5 | 1686 | 11.7% |
| 4 | 1145 | 8.0% |
| 6 | 604 | 4.2% |
| 8 | 507 | 3.5% |
| 9 | 366 | 2.5% |
| 7 | 364 | 2.5% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 731 | |
| ~ | 506 | |
| | | 136 | 9.3% |
| ± | 41 | 2.8% |
| = | 31 | 2.1% |
| < | 8 | 0.5% |
| > | 5 | 0.3% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 15265 | |
| ] | 168 | 1.1% |
| } | 62 | 0.4% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 15251 | |
| [ | 165 | 1.1% |
| { | 62 | 0.4% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 11006 | |
| – | 8 | 0.1% |
| — | 8 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 681742 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 31 |
Other Letter
| Value | Count | Frequency (%) |
| º | 8 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 6 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 4 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 3 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 1 |
Other Number
| Value | Count | Frequency (%) |
| ² | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4788172 | |
| Common | 879331 | 15.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 506903 | 10.6% |
| a | 436200 | 9.1% |
| o | 418533 | 8.7% |
| r | 374871 | 7.8% |
| s | 360402 | 7.5% |
| n | 320370 | 6.7% |
| i | 282376 | 5.9% |
| t | 273985 | 5.7% |
| l | 250910 | 5.2% |
| d | 202477 | 4.2% |
| Other values (72) | 1361145 |
Common
| Value | Count | Frequency (%) |
| 681742 | ||
| , | 56768 | 6.5% |
| . | 53985 | 6.1% |
| ) | 15265 | 1.7% |
| ( | 15251 | 1.7% |
| : | 13864 | 1.6% |
| - | 11006 | 1.3% |
| ; | 7795 | 0.9% |
| 0 | 4308 | 0.5% |
| & | 2835 | 0.3% |
| Other values (39) | 16512 | 1.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5659558 | |
| None | 7888 | 0.1% |
| Punctuation | 57 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 681742 | 12.0% | |
| e | 506903 | 9.0% |
| a | 436200 | 7.7% |
| o | 418533 | 7.4% |
| r | 374871 | 6.6% |
| s | 360402 | 6.4% |
| n | 320370 | 5.7% |
| i | 282376 | 5.0% |
| t | 273985 | 4.8% |
| l | 250910 | 4.4% |
| Other values (82) | 1753266 |
None
| Value | Count | Frequency (%) |
| ú | 1179 | |
| ê | 1157 | |
| é | 1124 | |
| ó | 1102 | |
| í | 913 | |
| á | 821 | |
| ñ | 640 | |
| è | 414 | 5.2% |
| à | 133 | 1.7% |
| ã | 61 | 0.8% |
| Other values (24) | 344 | 4.4% |
Punctuation
| Value | Count | Frequency (%) |
| … | 31 | |
| – | 8 | 14.0% |
| — | 8 | 14.0% |
| ” | 6 | 10.5% |
| “ | 4 | 7.0% |
samplingEffort
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361472 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 67.0 |
|---|
| Value | Count | Frequency (%) |
| 67.0 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 1 | |
| 7 | 1 | |
| . | 1 | |
| 0 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3 | |
| Other Punctuation | 1 | 25.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 1 | |
| 7 | 1 | |
| 0 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 6 | 1 | |
| 7 | 1 | |
| . | 1 | |
| 0 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6 | 1 | |
| 7 | 1 | |
| . | 1 | |
| 0 | 1 |
fieldNotes
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361472 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | -63.0 |
|---|
| Value | Count | Frequency (%) |
| 63.0 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 1 | |
| 6 | 1 | |
| 3 | 1 | |
| . | 1 | |
| 0 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3 | |
| Dash Punctuation | 1 | 20.0% |
| Other Punctuation | 1 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 1 | |
| 3 | 1 | |
| 0 | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 1 | |
| 6 | 1 | |
| 3 | 1 | |
| . | 1 | |
| 0 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 1 | |
| 6 | 1 | |
| 3 | 1 | |
| . | 1 | |
| 0 | 1 |
locationID
Text
Missing 
| Distinct | 50052 |
|---|---|
| Distinct (%) | 18.1% |
| Missing | 2084512 |
| Missing (%) | 88.3% |
| Memory size | 18.0 MiB |
Length
| Max length | 77799 |
|---|---|
| Median length | 131 |
| Mean length | 4.843338954 |
| Min length | 1 |
Unique
| Unique | 28095 ? |
|---|---|
| Unique (%) | 10.1% |
Sample
| 1st row | 31 |
|---|---|
| 2nd row | GS 03383 |
| 3rd row | M4 |
| 4th row | 9 |
| 5th row | 68-36 |
| Value | Count | Frequency (%) |
| d | 3566 | 1.1% |
| not | 3178 | 1.0% |
| rec | 3070 | 1.0% |
| 4 | 2339 | 0.7% |
| 1 | 2281 | 0.7% |
| rhb | 1929 | 0.6% |
| rfb | 1883 | 0.6% |
| 2 | 1847 | 0.6% |
| 3 | 1546 | 0.5% |
| 6 | 1528 | 0.5% |
| Other values (43774) | 294784 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 143532 | 10.7% |
| 2 | 119590 | 8.9% |
| 0 | 97991 | 7.3% |
| - | 87059 | 6.5% |
| 3 | 86532 | 6.5% |
| 5 | 86435 | 6.4% |
| 4 | 83135 | 6.2% |
| 6 | 74491 | 5.6% |
| 7 | 59619 | 4.4% |
| 8 | 54830 | 4.1% |
| Other values (89) | 448202 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 856082 | |
| Uppercase Letter | 265572 | 19.8% |
| Dash Punctuation | 87060 | 6.5% |
| Lowercase Letter | 52220 | 3.9% |
| Space Separator | 36726 | 2.7% |
| Other Punctuation | 24797 | 1.8% |
| Control | 13837 | 1.0% |
| Connector Punctuation | 2785 | 0.2% |
| Open Punctuation | 1126 | 0.1% |
| Close Punctuation | 1009 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 6785 | |
| t | 5566 | |
| o | 5269 | |
| e | 5025 | |
| i | 4772 | |
| n | 3806 | 7.3% |
| r | 3521 | 6.7% |
| l | 2693 | 5.2% |
| s | 2124 | 4.1% |
| u | 2087 | 4.0% |
| Other values (26) | 10572 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 27515 | 10.4% |
| S | 24535 | 9.2% |
| C | 21215 | 8.0% |
| B | 18453 | 6.9% |
| R | 17107 | 6.4% |
| M | 16994 | 6.4% |
| N | 16161 | 6.1% |
| E | 15064 | 5.7% |
| I | 13429 | 5.1% |
| T | 12946 | 4.9% |
| Other values (18) | 82153 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 10678 | |
| . | 8883 | |
| , | 2581 | 10.4% |
| / | 1586 | 6.4% |
| # | 425 | 1.7% |
| ; | 332 | 1.3% |
| & | 182 | 0.7% |
| ? | 70 | 0.3% |
| * | 30 | 0.1% |
| ' | 22 | 0.1% |
| Other values (2) | 8 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 143532 | |
| 2 | 119590 | |
| 0 | 97991 | |
| 3 | 86532 | |
| 5 | 86435 | |
| 4 | 83135 | |
| 6 | 74491 | |
| 7 | 59619 | |
| 8 | 54830 | 6.4% |
| 9 | 49927 | 5.8% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 196 | |
| = | 3 | 1.5% |
| | | 3 | 1.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 87059 | |
| – | 1 | < 0.1% |
Control
| Value | Count | Frequency (%) |
| 13775 | ||
| 62 | 0.4% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1047 | |
| [ | 79 | 7.0% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 930 | |
| ] | 79 | 7.8% |
Space Separator
| Value | Count | Frequency (%) |
| 36726 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2785 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1023624 | |
| Latin | 317792 | 23.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 27515 | 8.7% |
| S | 24535 | 7.7% |
| C | 21215 | 6.7% |
| B | 18453 | 5.8% |
| R | 17107 | 5.4% |
| M | 16994 | 5.3% |
| N | 16161 | 5.1% |
| E | 15064 | 4.7% |
| I | 13429 | 4.2% |
| T | 12946 | 4.1% |
| Other values (54) | 134373 |
Common
| Value | Count | Frequency (%) |
| 1 | 143532 | |
| 2 | 119590 | |
| 0 | 97991 | |
| - | 87059 | |
| 3 | 86532 | |
| 5 | 86435 | |
| 4 | 83135 | |
| 6 | 74491 | |
| 7 | 59619 | |
| 8 | 54830 | 5.4% |
| Other values (25) | 130410 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1341390 | |
| None | 25 | < 0.1% |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 143532 | 10.7% |
| 2 | 119590 | 8.9% |
| 0 | 97991 | 7.3% |
| - | 87059 | 6.5% |
| 3 | 86532 | 6.5% |
| 5 | 86435 | 6.4% |
| 4 | 83135 | 6.2% |
| 6 | 74491 | 5.6% |
| 7 | 59619 | 4.4% |
| 8 | 54830 | 4.1% |
| Other values (76) | 448176 |
None
| Value | Count | Frequency (%) |
| é | 5 | |
| ü | 4 | |
| ä | 4 | |
| Ö | 2 | 8.0% |
| í | 2 | 8.0% |
| á | 2 | 8.0% |
| ã | 1 | 4.0% |
| å | 1 | 4.0% |
| ö | 1 | 4.0% |
| è | 1 | 4.0% |
| Other values (2) | 2 | 8.0% |
Punctuation
| Value | Count | Frequency (%) |
| – | 1 |
higherGeography
Text
Missing 
| Distinct | 48477 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 73521 |
| Missing (%) | 3.1% |
| Memory size | 18.0 MiB |
Length
| Max length | 177 |
|---|---|
| Median length | 138 |
| Mean length | 40.44184187 |
| Min length | 4 |
Unique
| Unique | 15741 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | North Atlantic Ocean, Caribbean Sea, Belize |
|---|---|
| 2nd row | North America, United States, Tennessee |
| 3rd row | North America, United States, West Virginia, Randolph |
| 4th row | United States, Georgia, Decatur County |
| 5th row | North Atlantic Ocean, Gulf of Mexico, United States |
| Value | Count | Frequency (%) |
| america | 1138439 | 9.2% |
| north | 1106434 | 8.9% |
| united | 860922 | 6.9% |
| states | 853131 | 6.9% |
| 440814 | 3.5% | |
| south | 440485 | 3.5% |
| ocean | 430252 | 3.5% |
| neotropics | 407966 | 3.3% |
| atlantic | 224389 | 1.8% |
| pacific | 213984 | 1.7% |
| Other values (16557) | 6300876 |
Most occurring characters
| Value | Count | Frequency (%) |
| 10129740 | 10.9% | |
| a | 8943831 | 9.7% |
| i | 6761301 | 7.3% |
| e | 6630309 | 7.2% |
| t | 6497959 | 7.0% |
| r | 5098061 | 5.5% |
| o | 4988716 | 5.4% |
| , | 4762006 | 5.1% |
| n | 4606108 | 5.0% |
| c | 3725697 | 4.0% |
| Other values (165) | 30385265 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 64845487 | |
| Uppercase Letter | 12007518 | 13.0% |
| Space Separator | 10129740 | 10.9% |
| Other Punctuation | 4831886 | 5.2% |
| Dash Punctuation | 597716 | 0.6% |
| Open Punctuation | 58149 | 0.1% |
| Close Punctuation | 58139 | 0.1% |
| Modifier Letter | 149 | < 0.1% |
| Math Symbol | 90 | < 0.1% |
| Decimal Number | 73 | < 0.1% |
| Other values (2) | 46 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8943831 | |
| i | 6761301 | |
| e | 6630309 | |
| t | 6497959 | |
| r | 5098061 | |
| o | 4988716 | |
| n | 4606108 | 7.1% |
| c | 3725697 | 5.7% |
| s | 3264213 | 5.0% |
| l | 2250917 | 3.5% |
| Other values (82) | 12078375 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2049264 | |
| N | 1767234 | |
| S | 1721940 | |
| U | 927441 | |
| C | 879326 | |
| P | 645016 | 5.4% |
| M | 559337 | 4.7% |
| O | 533657 | 4.4% |
| I | 403326 | 3.4% |
| T | 335754 | 2.8% |
| Other values (37) | 2185223 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 4762006 | |
| . | 43825 | 0.9% |
| ' | 17324 | 0.4% |
| / | 6660 | 0.1% |
| ? | 1684 | < 0.1% |
| ; | 292 | < 0.1% |
| & | 39 | < 0.1% |
| * | 27 | < 0.1% |
| : | 24 | < 0.1% |
| " | 2 | < 0.1% |
| Other values (2) | 3 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 18 | |
| 2 | 18 | |
| 1 | 17 | |
| 0 | 9 | |
| 4 | 4 | 5.5% |
| 8 | 2 | 2.7% |
| 6 | 2 | 2.7% |
| 9 | 2 | 2.7% |
| 7 | 1 | 1.4% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 597579 | |
| – | 136 | < 0.1% |
| — | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 47065 | |
| ( | 11084 | 19.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 47054 | |
| ) | 11085 | 19.1% |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 128 | |
| ʼ | 21 | 14.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 87 | |
| + | 3 | 3.3% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 34 | |
| ¸ | 1 | 2.9% |
Space Separator
| Value | Count | Frequency (%) |
| 10129740 |
Format
| Value | Count | Frequency (%) |
| | 11 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 76853005 | |
| Common | 15675988 | 16.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 8943831 | 11.6% |
| i | 6761301 | 8.8% |
| e | 6630309 | 8.6% |
| t | 6497959 | 8.5% |
| r | 5098061 | 6.6% |
| o | 4988716 | 6.5% |
| n | 4606108 | 6.0% |
| c | 3725697 | 4.8% |
| s | 3264213 | 4.2% |
| l | 2250917 | 2.9% |
| Other values (129) | 24085893 |
Common
| Value | Count | Frequency (%) |
| 10129740 | ||
| , | 4762006 | |
| - | 597579 | 3.8% |
| [ | 47065 | 0.3% |
| ] | 47054 | 0.3% |
| . | 43825 | 0.3% |
| ' | 17324 | 0.1% |
| ) | 11085 | 0.1% |
| ( | 11084 | 0.1% |
| / | 6660 | < 0.1% |
| Other values (26) | 2566 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 92396075 | |
| None | 132576 | 0.1% |
| Modifier Letters | 149 | < 0.1% |
| Punctuation | 148 | < 0.1% |
| Latin Ext Additional | 45 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 10129740 | 11.0% | |
| a | 8943831 | 9.7% |
| i | 6761301 | 7.3% |
| e | 6630309 | 7.2% |
| t | 6497959 | 7.0% |
| r | 5098061 | 5.5% |
| o | 4988716 | 5.4% |
| , | 4762006 | 5.2% |
| n | 4606108 | 5.0% |
| c | 3725697 | 4.0% |
| Other values (70) | 30252347 |
None
| Value | Count | Frequency (%) |
| á | 42738 | |
| í | 24913 | |
| é | 22834 | |
| ó | 16557 | 12.5% |
| ã | 8603 | 6.5% |
| ô | 3849 | 2.9% |
| ç | 2216 | 1.7% |
| ñ | 2003 | 1.5% |
| Î | 1675 | 1.3% |
| ü | 1625 | 1.2% |
| Other values (67) | 5563 | 4.2% |
Punctuation
| Value | Count | Frequency (%) |
| – | 136 | |
| | 11 | 7.4% |
| — | 1 | 0.7% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 128 | |
| ʼ | 21 | 14.1% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ả | 10 | |
| ị | 8 | |
| ộ | 5 | |
| ố | 4 | 8.9% |
| ế | 3 | 6.7% |
| ừ | 3 | 6.7% |
| ḍ | 3 | 6.7% |
| ṭ | 3 | 6.7% |
| ậ | 2 | 4.4% |
| ẵ | 1 | 2.2% |
| Other values (3) | 3 | 6.7% |
continent
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 411637 |
| Missing (%) | 17.4% |
| Memory size | 18.0 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 10.839028 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NORTH_AMERICA |
|---|---|
| 2nd row | NORTH_AMERICA |
| 3rd row | NORTH_AMERICA |
| 4th row | NORTH_AMERICA |
| 5th row | ASIA |
| Value | Count | Frequency (%) |
| north_america | 1041974 | |
| south_america | 357651 | 18.3% |
| asia | 249517 | 12.8% |
| oceania | 115158 | 5.9% |
| africa | 104098 | 5.3% |
| europe | 75985 | 3.9% |
| antarctica | 5453 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 3753155 | |
| R | 2627135 | |
| I | 1873851 | |
| E | 1666753 | |
| C | 1629787 | |
| O | 1590768 | |
| T | 1410531 | 6.7% |
| H | 1399625 | 6.6% |
| _ | 1399625 | 6.6% |
| M | 1399625 | 6.6% |
| Other values (5) | 2383472 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 19734702 | |
| Connector Punctuation | 1399625 | 6.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 3753155 | |
| R | 2627135 | |
| I | 1873851 | |
| E | 1666753 | |
| C | 1629787 | |
| O | 1590768 | |
| T | 1410531 | 7.1% |
| H | 1399625 | 7.1% |
| M | 1399625 | 7.1% |
| N | 1162585 | 5.9% |
| Other values (4) | 1220887 | 6.2% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1399625 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 19734702 | |
| Common | 1399625 | 6.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 3753155 | |
| R | 2627135 | |
| I | 1873851 | |
| E | 1666753 | |
| C | 1629787 | |
| O | 1590768 | |
| T | 1410531 | 7.1% |
| H | 1399625 | 7.1% |
| M | 1399625 | 7.1% |
| N | 1162585 | 5.9% |
| Other values (4) | 1220887 | 6.2% |
Common
| Value | Count | Frequency (%) |
| _ | 1399625 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21134327 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 3753155 | |
| R | 2627135 | |
| I | 1873851 | |
| E | 1666753 | |
| C | 1629787 | |
| O | 1590768 | |
| T | 1410531 | 6.7% |
| H | 1399625 | 6.6% |
| _ | 1399625 | 6.6% |
| M | 1399625 | 6.6% |
| Other values (5) | 2383472 |
waterBody
Text
Missing 
| Distinct | 2466 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 1923759 |
| Missing (%) | 81.5% |
| Memory size | 18.0 MiB |
Length
| Max length | 75 |
|---|---|
| Median length | 73 |
| Mean length | 24.15323019 |
| Min length | 6 |
Unique
| Unique | 1005 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | North Atlantic Ocean, Caribbean Sea |
|---|---|
| 2nd row | North Atlantic Ocean, Gulf of Mexico |
| 3rd row | North Atlantic Ocean, Gulf of Mexico, Galveston Bay |
| 4th row | North Pacific Ocean, Gulf of California |
| 5th row | North Atlantic Ocean, Gulf of Guinea |
| Value | Count | Frequency (%) |
| ocean | 429243 | |
| north | 326216 | |
| atlantic | 224090 | |
| pacific | 174691 | |
| of | 70763 | 4.3% |
| sea | 70402 | 4.3% |
| gulf | 69641 | 4.2% |
| south | 61315 | 3.7% |
| mexico | 54265 | 3.3% |
| caribbean | 31788 | 1.9% |
| Other values (1777) | 138016 | 8.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1212716 | ||
| a | 1102411 | |
| c | 1096281 | |
| t | 883208 | 8.4% |
| n | 794119 | 7.5% |
| i | 741797 | 7.0% |
| e | 632056 | 6.0% |
| o | 541092 | 5.1% |
| O | 432028 | 4.1% |
| r | 407634 | 3.9% |
| Other values (63) | 2728865 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7617530 | |
| Uppercase Letter | 1581582 | 15.0% |
| Space Separator | 1212716 | 11.5% |
| Other Punctuation | 159679 | 1.5% |
| Dash Punctuation | 534 | < 0.1% |
| Modifier Letter | 122 | < 0.1% |
| Open Punctuation | 22 | < 0.1% |
| Close Punctuation | 22 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1102411 | |
| c | 1096281 | |
| t | 883208 | |
| n | 794119 | |
| i | 741797 | |
| e | 632056 | |
| o | 541092 | |
| r | 407634 | 5.4% |
| h | 400540 | 5.3% |
| f | 319349 | 4.2% |
| Other values (23) | 699043 |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 432028 | |
| N | 327077 | |
| A | 243543 | |
| P | 180869 | |
| S | 145435 | 9.2% |
| G | 71397 | 4.5% |
| M | 64603 | 4.1% |
| C | 46516 | 2.9% |
| B | 26617 | 1.7% |
| I | 22994 | 1.5% |
| Other values (16) | 20503 | 1.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 158850 | |
| ; | 291 | 0.2% |
| ' | 213 | 0.1% |
| . | 161 | 0.1% |
| / | 124 | 0.1% |
| ? | 21 | < 0.1% |
| : | 19 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 21 | |
| [ | 1 | 4.5% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 21 | |
| ] | 1 | 4.5% |
Space Separator
| Value | Count | Frequency (%) |
| 1212716 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 534 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 122 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9199112 | |
| Common | 1373095 | 13.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1102411 | |
| c | 1096281 | |
| t | 883208 | |
| n | 794119 | 8.6% |
| i | 741797 | 8.1% |
| e | 632056 | 6.9% |
| o | 541092 | 5.9% |
| O | 432028 | 4.7% |
| r | 407634 | 4.4% |
| h | 400540 | 4.4% |
| Other values (49) | 2167946 |
Common
| Value | Count | Frequency (%) |
| 1212716 | ||
| , | 158850 | 11.6% |
| - | 534 | < 0.1% |
| ; | 291 | < 0.1% |
| ' | 213 | < 0.1% |
| . | 161 | < 0.1% |
| / | 124 | < 0.1% |
| ʻ | 122 | < 0.1% |
| ( | 21 | < 0.1% |
| ) | 21 | < 0.1% |
| Other values (4) | 42 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10571819 | |
| None | 266 | < 0.1% |
| Modifier Letters | 122 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1212716 | ||
| a | 1102411 | |
| c | 1096281 | |
| t | 883208 | 8.4% |
| n | 794119 | 7.5% |
| i | 741797 | 7.0% |
| e | 632056 | 6.0% |
| o | 541092 | 5.1% |
| O | 432028 | 4.1% |
| r | 407634 | 3.9% |
| Other values (54) | 2728477 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 122 |
None
| Value | Count | Frequency (%) |
| ā | 122 | |
| í | 57 | |
| á | 33 | 12.4% |
| ñ | 23 | 8.6% |
| ó | 13 | 4.9% |
| é | 12 | 4.5% |
| è | 5 | 1.9% |
| É | 1 | 0.4% |
islandGroup
Text
Missing 
| Distinct | 655 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 2309219 |
| Missing (%) | 97.8% |
| Memory size | 18.0 MiB |
Length
| Max length | 45 |
|---|---|
| Median length | 41 |
| Mean length | 14.63855399 |
| Min length | 4 |
Unique
| Unique | 152 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | Pelican Cays |
|---|---|
| 2nd row | Greater Antilles |
| 3rd row | Stewart Islands |
| 4th row | Ralik Chain |
| 5th row | Virgin Islands |
| Value | Count | Frequency (%) |
| islands | 18220 | 16.0% |
| antilles | 8706 | 7.7% |
| greater | 8542 | 7.5% |
| group | 7726 | 6.8% |
| is | 5006 | 4.4% |
| leeward | 2799 | 2.5% |
| new | 2396 | 2.1% |
| hispaniola | 2301 | 2.0% |
| chain | 2114 | 1.9% |
| virgin | 1728 | 1.5% |
| Other values (552) | 54220 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 89318 | 11.7% |
| s | 68956 | 9.0% |
| 61504 | 8.0% | |
| n | 56352 | 7.4% |
| l | 54957 | 7.2% |
| e | 53661 | 7.0% |
| r | 46030 | 6.0% |
| i | 39007 | 5.1% |
| d | 31951 | 4.2% |
| t | 28112 | 3.7% |
| Other values (59) | 235075 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 583346 | |
| Uppercase Letter | 112247 | 14.7% |
| Space Separator | 61504 | 8.0% |
| Other Punctuation | 5467 | 0.7% |
| Open Punctuation | 1168 | 0.2% |
| Close Punctuation | 1168 | 0.2% |
| Dash Punctuation | 11 | < 0.1% |
| Format | 11 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 89318 | |
| s | 68956 | |
| n | 56352 | |
| l | 54957 | |
| e | 53661 | |
| r | 46030 | |
| i | 39007 | |
| d | 31951 | 5.5% |
| t | 28112 | 4.8% |
| o | 25531 | 4.4% |
| Other values (20) | 89471 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 25086 | |
| G | 19795 | |
| A | 10876 | |
| C | 8571 | 7.6% |
| V | 5998 | 5.3% |
| L | 5665 | 5.0% |
| S | 5499 | 4.9% |
| B | 4199 | 3.7% |
| N | 3529 | 3.1% |
| R | 3409 | 3.0% |
| Other values (17) | 19620 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 5004 | |
| ' | 455 | 8.3% |
| , | 6 | 0.1% |
| ? | 2 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 692 | |
| [ | 476 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 692 | |
| ] | 476 |
Space Separator
| Value | Count | Frequency (%) |
| 61504 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 11 |
Format
| Value | Count | Frequency (%) |
| | 11 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 695593 | |
| Common | 69330 | 9.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 89318 | |
| s | 68956 | 9.9% |
| n | 56352 | 8.1% |
| l | 54957 | 7.9% |
| e | 53661 | 7.7% |
| r | 46030 | 6.6% |
| i | 39007 | 5.6% |
| d | 31951 | 4.6% |
| t | 28112 | 4.0% |
| o | 25531 | 3.7% |
| Other values (47) | 201718 |
Common
| Value | Count | Frequency (%) |
| 61504 | ||
| . | 5004 | 7.2% |
| ( | 692 | 1.0% |
| ) | 692 | 1.0% |
| ] | 476 | 0.7% |
| [ | 476 | 0.7% |
| ' | 455 | 0.7% |
| - | 11 | < 0.1% |
| | 11 | < 0.1% |
| , | 6 | < 0.1% |
| Other values (2) | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 762646 | |
| None | 2266 | 0.3% |
| Punctuation | 11 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 89318 | 11.7% |
| s | 68956 | 9.0% |
| 61504 | 8.1% | |
| n | 56352 | 7.4% |
| l | 54957 | 7.2% |
| e | 53661 | 7.0% |
| r | 46030 | 6.0% |
| i | 39007 | 5.1% |
| d | 31951 | 4.2% |
| t | 28112 | 3.7% |
| Other values (52) | 232798 |
None
| Value | Count | Frequency (%) |
| Î | 1196 | |
| á | 1048 | |
| Ō | 16 | 0.7% |
| ñ | 4 | 0.2% |
| ù | 1 | < 0.1% |
| à | 1 | < 0.1% |
Punctuation
| Value | Count | Frequency (%) |
| | 11 |
island
Text
Missing 
| Distinct | 4075 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 2204401 |
| Missing (%) | 93.3% |
| Memory size | 18.0 MiB |
Length
| Max length | 47 |
|---|---|
| Median length | 41 |
| Mean length | 9.542050779 |
| Min length | 3 |
Unique
| Unique | 1250 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | Honshu |
|---|---|
| 2nd row | Lana'i |
| 3rd row | Cat Cay |
| 4th row | Hawaii |
| 5th row | Sumatra |
| Value | Count | Frequency (%) |
| island | 26207 | 10.9% |
| hispaniola | 12814 | 5.3% |
| cuba | 6496 | 2.7% |
| oahu | 6126 | 2.5% |
| atoll | 5648 | 2.3% |
| luzon | 5340 | 2.2% |
| new | 4804 | 2.0% |
| bermuda | 4124 | 1.7% |
| guinea | 3811 | 1.6% |
| st | 3730 | 1.5% |
| Other values (3177) | 162065 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 230929 | |
| n | 107175 | 7.2% |
| i | 100018 | 6.7% |
| o | 93269 | 6.2% |
| 84093 | 5.6% | |
| l | 83151 | 5.5% |
| e | 75481 | 5.0% |
| u | 73701 | 4.9% |
| s | 67954 | 4.5% |
| r | 59918 | 4.0% |
| Other values (77) | 523100 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1155574 | |
| Uppercase Letter | 236803 | 15.8% |
| Space Separator | 84093 | 5.6% |
| Other Punctuation | 11708 | 0.8% |
| Close Punctuation | 4956 | 0.3% |
| Open Punctuation | 4953 | 0.3% |
| Dash Punctuation | 695 | < 0.1% |
| Decimal Number | 6 | < 0.1% |
| Modifier Letter | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 230929 | |
| n | 107175 | |
| i | 100018 | |
| o | 93269 | |
| l | 83151 | 7.2% |
| e | 75481 | 6.5% |
| u | 73701 | 6.4% |
| s | 67954 | 5.9% |
| r | 59918 | 5.2% |
| d | 53485 | 4.6% |
| Other values (33) | 210493 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 33564 | |
| C | 22392 | 9.5% |
| H | 22087 | 9.3% |
| B | 17981 | 7.6% |
| S | 17953 | 7.6% |
| M | 15275 | 6.5% |
| T | 11260 | 4.8% |
| A | 10790 | 4.6% |
| G | 10713 | 4.5% |
| L | 10554 | 4.5% |
| Other values (18) | 64234 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 5656 | |
| ' | 5655 | |
| , | 354 | 3.0% |
| ? | 34 | 0.3% |
| / | 9 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 3 | 2 | |
| 2 | 1 | |
| 6 | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 3618 | |
| ) | 1338 | 27.0% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 3618 | |
| ( | 1335 | 27.0% |
Space Separator
| Value | Count | Frequency (%) |
| 84093 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 695 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1392377 | |
| Common | 106412 | 7.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 230929 | |
| n | 107175 | 7.7% |
| i | 100018 | 7.2% |
| o | 93269 | 6.7% |
| l | 83151 | 6.0% |
| e | 75481 | 5.4% |
| u | 73701 | 5.3% |
| s | 67954 | 4.9% |
| r | 59918 | 4.3% |
| d | 53485 | 3.8% |
| Other values (61) | 447296 |
Common
| Value | Count | Frequency (%) |
| 84093 | ||
| . | 5656 | 5.3% |
| ' | 5655 | 5.3% |
| ] | 3618 | 3.4% |
| [ | 3618 | 3.4% |
| ) | 1338 | 1.3% |
| ( | 1335 | 1.3% |
| - | 695 | 0.7% |
| , | 354 | 0.3% |
| ? | 34 | < 0.1% |
| Other values (6) | 16 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1496976 | |
| None | 1808 | 0.1% |
| Latin Ext Additional | 4 | < 0.1% |
| Modifier Letters | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 230929 | |
| n | 107175 | 7.2% |
| i | 100018 | 6.7% |
| o | 93269 | 6.2% |
| 84093 | 5.6% | |
| l | 83151 | 5.6% |
| e | 75481 | 5.0% |
| u | 73701 | 4.9% |
| s | 67954 | 4.5% |
| r | 59918 | 4.0% |
| Other values (56) | 521287 |
None
| Value | Count | Frequency (%) |
| ç | 458 | |
| Î | 396 | |
| ó | 249 | |
| é | 247 | |
| á | 175 | 9.7% |
| â | 101 | 5.6% |
| ñ | 69 | 3.8% |
| ã | 48 | 2.7% |
| í | 17 | 0.9% |
| Ö | 14 | 0.8% |
| Other values (9) | 34 | 1.9% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ố | 4 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 1 |
countryCode
Text
Missing 
| Distinct | 247 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 95309 |
| Missing (%) | 4.0% |
| Memory size | 18.0 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | BZ |
|---|---|
| 2nd row | US |
| 3rd row | US |
| 4th row | US |
| 5th row | US |
| Value | Count | Frequency (%) |
| us | 845741 | |
| mx | 117278 | 5.2% |
| br | 95213 | 4.2% |
| ph | 68631 | 3.0% |
| co | 59012 | 2.6% |
| ca | 50585 | 2.2% |
| pa | 48976 | 2.2% |
| ve | 43923 | 1.9% |
| cn | 40185 | 1.8% |
| pe | 39643 | 1.7% |
| Other values (237) | 856977 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 922314 | |
| S | 901987 | |
| C | 277772 | 6.1% |
| P | 257294 | 5.7% |
| M | 212476 | 4.7% |
| R | 196295 | 4.3% |
| A | 194321 | 4.3% |
| B | 172654 | 3.8% |
| E | 158329 | 3.5% |
| H | 122627 | 2.7% |
| Other values (16) | 1116259 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 4532328 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 922314 | |
| S | 901987 | |
| C | 277772 | 6.1% |
| P | 257294 | 5.7% |
| M | 212476 | 4.7% |
| R | 196295 | 4.3% |
| A | 194321 | 4.3% |
| B | 172654 | 3.8% |
| E | 158329 | 3.5% |
| H | 122627 | 2.7% |
| Other values (16) | 1116259 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4532328 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 922314 | |
| S | 901987 | |
| C | 277772 | 6.1% |
| P | 257294 | 5.7% |
| M | 212476 | 4.7% |
| R | 196295 | 4.3% |
| A | 194321 | 4.3% |
| B | 172654 | 3.8% |
| E | 158329 | 3.5% |
| H | 122627 | 2.7% |
| Other values (16) | 1116259 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4532328 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 922314 | |
| S | 901987 | |
| C | 277772 | 6.1% |
| P | 257294 | 5.7% |
| M | 212476 | 4.7% |
| R | 196295 | 4.3% |
| A | 194321 | 4.3% |
| B | 172654 | 3.8% |
| E | 158329 | 3.5% |
| H | 122627 | 2.7% |
| Other values (16) | 1116259 |
stateProvince
Text
Missing 
| Distinct | 7056 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 637065 |
| Missing (%) | 27.0% |
| Memory size | 18.0 MiB |
Length
| Max length | 69 |
|---|---|
| Median length | 52 |
| Mean length | 9.275850611 |
| Min length | 1 |
Unique
| Unique | 1731 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Tennessee |
|---|---|
| 2nd row | West Virginia |
| 3rd row | Georgia |
| 4th row | Maine |
| 5th row | Texas |
| Value | Count | Frequency (%) |
| california | 92210 | 4.0% |
| florida | 79202 | 3.5% |
| virginia | 63444 | 2.8% |
| carolina | 49684 | 2.2% |
| new | 49642 | 2.2% |
| north | 41844 | 1.8% |
| texas | 40438 | 1.8% |
| alaska | 39419 | 1.7% |
| massachusetts | 36351 | 1.6% |
| maryland | 30762 | 1.3% |
| Other values (5148) | 1769153 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2411525 | |
| i | 1368936 | 8.6% |
| n | 1172665 | 7.3% |
| o | 1168862 | 7.3% |
| r | 1039411 | 6.5% |
| e | 843437 | 5.3% |
| s | 754678 | 4.7% |
| l | 682299 | 4.3% |
| t | 604620 | 3.8% |
| 567741 | 3.5% | |
| Other values (140) | 5381177 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13046442 | |
| Uppercase Letter | 2281249 | 14.3% |
| Space Separator | 567741 | 3.5% |
| Dash Punctuation | 43849 | 0.3% |
| Other Punctuation | 29327 | 0.2% |
| Open Punctuation | 13313 | 0.1% |
| Close Punctuation | 13311 | 0.1% |
| Math Symbol | 70 | < 0.1% |
| Decimal Number | 27 | < 0.1% |
| Modifier Letter | 21 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2411525 | |
| i | 1368936 | |
| n | 1172665 | |
| o | 1168862 | |
| r | 1039411 | |
| e | 843437 | 6.5% |
| s | 754678 | 5.8% |
| l | 682299 | 5.2% |
| t | 604620 | 4.6% |
| u | 477899 | 3.7% |
| Other values (72) | 2522110 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 328285 | |
| M | 227765 | 10.0% |
| N | 175545 | 7.7% |
| S | 173673 | 7.6% |
| A | 166763 | 7.3% |
| P | 129861 | 5.7% |
| T | 113347 | 5.0% |
| V | 100836 | 4.4% |
| F | 96243 | 4.2% |
| B | 78697 | 3.4% |
| Other values (33) | 690234 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 19409 | |
| / | 3917 | 13.4% |
| ' | 3030 | 10.3% |
| , | 2241 | 7.6% |
| ? | 702 | 2.4% |
| & | 27 | 0.1% |
| * | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 16 | |
| 4 | 3 | 11.1% |
| 2 | 2 | 7.4% |
| 8 | 2 | 7.4% |
| 9 | 2 | 7.4% |
| 6 | 1 | 3.7% |
| 7 | 1 | 3.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 43838 | |
| – | 11 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 7637 | |
| ( | 5676 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 7636 | |
| ) | 5675 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 68 | |
| + | 2 | 2.9% |
Space Separator
| Value | Count | Frequency (%) |
| 567741 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʼ | 21 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ¸ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15327691 | |
| Common | 667660 | 4.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2411525 | |
| i | 1368936 | 8.9% |
| n | 1172665 | 7.7% |
| o | 1168862 | 7.6% |
| r | 1039411 | 6.8% |
| e | 843437 | 5.5% |
| s | 754678 | 4.9% |
| l | 682299 | 4.5% |
| t | 604620 | 3.9% |
| u | 477899 | 3.1% |
| Other values (115) | 4803359 |
Common
| Value | Count | Frequency (%) |
| 567741 | ||
| - | 43838 | 6.6% |
| . | 19409 | 2.9% |
| [ | 7637 | 1.1% |
| ] | 7636 | 1.1% |
| ( | 5676 | 0.9% |
| ) | 5675 | 0.8% |
| / | 3917 | 0.6% |
| ' | 3030 | 0.5% |
| , | 2241 | 0.3% |
| Other values (15) | 860 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15889050 | |
| None | 106241 | 0.7% |
| Latin Ext Additional | 28 | < 0.1% |
| Modifier Letters | 21 | < 0.1% |
| Punctuation | 11 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2411525 | |
| i | 1368936 | 8.6% |
| n | 1172665 | 7.4% |
| o | 1168862 | 7.4% |
| r | 1039411 | 6.5% |
| e | 843437 | 5.3% |
| s | 754678 | 4.7% |
| l | 682299 | 4.3% |
| t | 604620 | 3.8% |
| 567741 | 3.6% | |
| Other values (64) | 5274876 |
None
| Value | Count | Frequency (%) |
| á | 37610 | |
| í | 21537 | |
| é | 17350 | |
| ó | 12736 | 12.0% |
| ã | 6488 | 6.1% |
| ô | 3458 | 3.3% |
| ñ | 1592 | 1.5% |
| ü | 1325 | 1.2% |
| ä | 729 | 0.7% |
| å | 578 | 0.5% |
| Other values (53) | 2838 | 2.7% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʼ | 21 |
Punctuation
| Value | Count | Frequency (%) |
| – | 11 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ị | 8 | |
| ế | 3 | 10.7% |
| ừ | 3 | 10.7% |
| ḍ | 3 | 10.7% |
| ṭ | 3 | 10.7% |
| ộ | 2 | 7.1% |
| ậ | 2 | 7.1% |
| ằ | 1 | 3.6% |
| ẵ | 1 | 3.6% |
| ḑ | 1 | 3.6% |
county
Text
Missing 
| Distinct | 13641 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 1825433 |
| Missing (%) | 77.3% |
| Memory size | 18.0 MiB |
Length
| Max length | 56 |
|---|---|
| Median length | 45 |
| Mean length | 10.24671853 |
| Min length | 1 |
Unique
| Unique | 4083 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | Randolph |
|---|---|
| 2nd row | Decatur County |
| 3rd row | Penobscot |
| 4th row | Galveston County |
| 5th row | Dona Ana |
| Value | Count | Frequency (%) |
| county | 89628 | 10.8% |
| not | 33487 | 4.0% |
| stated | 33487 | 4.0% |
| san | 13117 | 1.6% |
| prince | 8967 | 1.1% |
| montgomery | 8280 | 1.0% |
| district | 8120 | 1.0% |
| santa | 7340 | 0.9% |
| honolulu | 7298 | 0.9% |
| 6994 | 0.8% | |
| Other values (9747) | 611161 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 521113 | 9.5% |
| o | 447668 | 8.2% |
| n | 426872 | 7.8% |
| e | 416440 | 7.6% |
| t | 377433 | 6.9% |
| r | 296405 | 5.4% |
| 291839 | 5.3% | |
| i | 275803 | 5.0% |
| u | 235367 | 4.3% |
| l | 211481 | 3.9% |
| Other values (117) | 1992230 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4288557 | |
| Uppercase Letter | 816059 | 14.9% |
| Space Separator | 291839 | 5.3% |
| Open Punctuation | 35866 | 0.7% |
| Close Punctuation | 35855 | 0.7% |
| Other Punctuation | 13360 | 0.2% |
| Dash Punctuation | 11015 | 0.2% |
| Decimal Number | 42 | < 0.1% |
| Modifier Symbol | 34 | < 0.1% |
| Math Symbol | 19 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 521113 | |
| o | 447668 | |
| n | 426872 | |
| e | 416440 | |
| t | 377433 | |
| r | 296405 | 6.9% |
| i | 275803 | 6.4% |
| u | 235367 | 5.5% |
| l | 211481 | 4.9% |
| s | 188065 | 4.4% |
| Other values (50) | 891910 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 159354 | |
| S | 97656 | |
| M | 66313 | 8.1% |
| N | 51844 | 6.4% |
| B | 47582 | 5.8% |
| P | 47574 | 5.8% |
| A | 39007 | 4.8% |
| L | 32820 | 4.0% |
| H | 32647 | 4.0% |
| D | 32163 | 3.9% |
| Other values (30) | 209099 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 7093 | |
| . | 4324 | |
| / | 1402 | 10.5% |
| ? | 289 | 2.2% |
| , | 210 | 1.6% |
| * | 25 | 0.2% |
| & | 12 | 0.1% |
| ¡ | 2 | < 0.1% |
| ; | 1 | < 0.1% |
| \ | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 17 | |
| 2 | 15 | |
| 0 | 7 | |
| 4 | 2 | 4.8% |
| 6 | 1 | 2.4% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 33508 | |
| ( | 2358 | 6.6% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 33498 | |
| ) | 2357 | 6.6% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 10890 | |
| – | 125 | 1.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 18 | |
| + | 1 | 5.3% |
Space Separator
| Value | Count | Frequency (%) |
| 291839 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 34 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5104616 | |
| Common | 388035 | 7.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 521113 | 10.2% |
| o | 447668 | 8.8% |
| n | 426872 | 8.4% |
| e | 416440 | 8.2% |
| t | 377433 | 7.4% |
| r | 296405 | 5.8% |
| i | 275803 | 5.4% |
| u | 235367 | 4.6% |
| l | 211481 | 4.1% |
| s | 188065 | 3.7% |
| Other values (90) | 1707969 |
Common
| Value | Count | Frequency (%) |
| 291839 | ||
| [ | 33508 | 8.6% |
| ] | 33498 | 8.6% |
| - | 10890 | 2.8% |
| ' | 7093 | 1.8% |
| . | 4324 | 1.1% |
| ( | 2358 | 0.6% |
| ) | 2357 | 0.6% |
| / | 1402 | 0.4% |
| ? | 289 | 0.1% |
| Other values (17) | 477 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5474783 | |
| None | 17734 | 0.3% |
| Punctuation | 125 | < 0.1% |
| Modifier Letters | 5 | < 0.1% |
| Latin Ext Additional | 4 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 521113 | 9.5% |
| o | 447668 | 8.2% |
| n | 426872 | 7.8% |
| e | 416440 | 7.6% |
| t | 377433 | 6.9% |
| r | 296405 | 5.4% |
| 291839 | 5.3% | |
| i | 275803 | 5.0% |
| u | 235367 | 4.3% |
| l | 211481 | 3.9% |
| Other values (65) | 1974362 |
None
| Value | Count | Frequency (%) |
| á | 3810 | |
| é | 3219 | |
| í | 2995 | |
| ó | 2587 | |
| ã | 1808 | |
| ç | 1124 | 6.3% |
| ô | 367 | 2.1% |
| è | 364 | 2.1% |
| ñ | 315 | 1.8% |
| ü | 299 | 1.7% |
| Other values (38) | 846 | 4.8% |
Punctuation
| Value | Count | Frequency (%) |
| – | 125 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 5 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ộ | 3 | |
| ắ | 1 | 25.0% |
municipality
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361472 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | -53.33 |
|---|
| Value | Count | Frequency (%) |
| 53.33 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 3 | |
| - | 1 | 16.7% |
| 5 | 1 | 16.7% |
| . | 1 | 16.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4 | |
| Dash Punctuation | 1 | 16.7% |
| Other Punctuation | 1 | 16.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 3 | |
| 5 | 1 | 25.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 3 | |
| - | 1 | 16.7% |
| 5 | 1 | 16.7% |
| . | 1 | 16.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 3 | |
| - | 1 | 16.7% |
| 5 | 1 | 16.7% |
| . | 1 | 16.7% |
locality
Text
Missing 
| Distinct | 924366 |
|---|---|
| Distinct (%) | 45.7% |
| Missing | 337166 |
| Missing (%) | 14.3% |
| Memory size | 18.0 MiB |
Length
| Max length | 220527 |
|---|---|
| Median length | 381 |
| Mean length | 40.58323318 |
| Min length | 1 |
Unique
| Unique | 736013 ? |
|---|---|
| Unique (%) | 36.4% |
Sample
| 1st row | Carrie Bow Cay, Spur And Groove Zone |
|---|---|
| 2nd row | Eastern edge of Nashville, Davidson County. |
| 3rd row | Monongahela National Forest, 1.2-1.4 mi (by road) W of Bear Heaven Campground, on road to Bickle Knob |
| 4th row | Hales Landing, Flint River about 7 miles below Bainbridge, basal Chattahoochee Formation, Oligocene, Vicksburgian |
| 5th row | Orono |
| Value | Count | Frequency (%) |
| of | 676638 | 5.1% |
| de | 173473 | 1.3% |
| island | 171486 | 1.3% |
| km | 144885 | 1.1% |
| on | 127233 | 1.0% |
| near | 121493 | 0.9% |
| the | 114523 | 0.9% |
| road | 113788 | 0.9% |
| mi | 107866 | 0.8% |
| and | 105679 | 0.8% |
| Other values (335687) | 11406497 |
Most occurring characters
| Value | Count | Frequency (%) |
| 11193478 | 13.6% | |
| a | 7469383 | 9.1% |
| e | 5570223 | 6.8% |
| o | 5407301 | 6.6% |
| n | 4552644 | 5.5% |
| i | 4136655 | 5.0% |
| r | 3972806 | 4.8% |
| t | 3638627 | 4.4% |
| l | 2985176 | 3.6% |
| s | 2875069 | 3.5% |
| Other values (317) | 30351561 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 55766334 | |
| Space Separator | 11193478 | 13.6% |
| Uppercase Letter | 9422464 | 11.5% |
| Other Punctuation | 3689608 | 4.5% |
| Decimal Number | 1341617 | 1.6% |
| Open Punctuation | 187756 | 0.2% |
| Close Punctuation | 187019 | 0.2% |
| Dash Punctuation | 184291 | 0.2% |
| Control | 148080 | 0.2% |
| Math Symbol | 15175 | < 0.1% |
| Other values (11) | 17101 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 7469383 | |
| e | 5570223 | |
| o | 5407301 | |
| n | 4552644 | 8.2% |
| i | 4136655 | 7.4% |
| r | 3972806 | 7.1% |
| t | 3638627 | 6.5% |
| l | 2985176 | 5.4% |
| s | 2875069 | 5.2% |
| u | 2055702 | 3.7% |
| Other values (132) | 13102748 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 961889 | 10.2% |
| C | 957641 | 10.2% |
| M | 661223 | 7.0% |
| P | 637053 | 6.8% |
| R | 626599 | 6.7% |
| B | 575723 | 6.1% |
| N | 541498 | 5.7% |
| A | 453780 | 4.8% |
| I | 410717 | 4.4% |
| L | 409160 | 4.3% |
| Other values (70) | 3187181 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1706617 | |
| . | 1642092 | |
| : | 131554 | 3.6% |
| ; | 81659 | 2.2% |
| ' | 58445 | 1.6% |
| " | 28934 | 0.8% |
| / | 20650 | 0.6% |
| & | 11584 | 0.3% |
| # | 3572 | 0.1% |
| ? | 3483 | 0.1% |
| Other values (9) | 1018 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 262130 | |
| 2 | 192764 | |
| 0 | 182124 | |
| 5 | 152353 | |
| 3 | 133850 | |
| 4 | 110344 | |
| 6 | 96303 | 7.2% |
| 7 | 74994 | 5.6% |
| 8 | 73246 | 5.5% |
| 9 | 63509 | 4.7% |
Control
| Value | Count | Frequency (%) |
| 147366 | ||
| 665 | 0.4% | |
| | 16 | < 0.1% |
| | 11 | < 0.1% |
| | 9 | < 0.1% |
| | 8 | < 0.1% |
| | 2 | < 0.1% |
| | 2 | < 0.1% |
| 1 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 10268 | |
| + | 2352 | 15.5% |
| ± | 1347 | 8.9% |
| ~ | 419 | 2.8% |
| > | 417 | 2.7% |
| < | 329 | 2.2% |
| | | 35 | 0.2% |
| → | 5 | < 0.1% |
| ∆ | 3 | < 0.1% |
Other Number
| Value | Count | Frequency (%) |
| ½ | 3259 | |
| ¼ | 1386 | |
| ¾ | 192 | 3.9% |
| ² | 22 | 0.5% |
| ⅓ | 18 | 0.4% |
| ⅛ | 2 | < 0.1% |
| ³ | 2 | < 0.1% |
| ⅜ | 1 | < 0.1% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 2020 | |
| ├ | 6 | 0.3% |
| ░ | 3 | 0.1% |
| ┬ | 3 | 0.1% |
| ▒ | 1 | < 0.1% |
| © | 1 | < 0.1% |
Format
| Value | Count | Frequency (%) |
| | 28 | |
| | 2 | 5.7% |
| | 2 | 5.7% |
| | 1 | 2.9% |
| | 1 | 2.9% |
| | 1 | 2.9% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 141352 | |
| [ | 46198 | 24.6% |
| „ | 99 | 0.1% |
| { | 54 | < 0.1% |
| ‚ | 53 | < 0.1% |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 122 | |
| ᵉ | 2 | 1.6% |
| ᵍ | 1 | 0.8% |
| ᴱ | 1 | 0.8% |
| ᴸ | 1 | 0.8% |
Currency Symbol
| Value | Count | Frequency (%) |
| ¢ | 39 | |
| ¤ | 17 | |
| £ | 3 | 4.8% |
| $ | 3 | 4.8% |
| ¥ | 1 | 1.6% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ̄ | 2 | |
| ̈ | 2 | |
| ᷉ | 1 | |
| ̌ | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 184274 | |
| – | 11 | < 0.1% |
| — | 6 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 140703 | |
| ] | 46254 | 24.7% |
| } | 62 | < 0.1% |
Final Punctuation
| Value | Count | Frequency (%) |
| » | 205 | |
| ” | 15 | 6.6% |
| › | 6 | 2.7% |
Initial Punctuation
| Value | Count | Frequency (%) |
| « | 201 | |
| “ | 37 | 15.5% |
| ‛ | 1 | 0.4% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 136 | |
| ¨ | 9 | 6.0% |
| ^ | 5 | 3.3% |
Other Letter
| Value | Count | Frequency (%) |
| º | 859 | |
| ª | 17 | 1.9% |
Space Separator
| Value | Count | Frequency (%) |
| 11193478 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 8463 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 65189652 | |
| Common | 16963237 | 20.6% |
| Greek | 27 | < 0.1% |
| Inherited | 7 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 7469383 | 11.5% |
| e | 5570223 | 8.5% |
| o | 5407301 | 8.3% |
| n | 4552644 | 7.0% |
| i | 4136655 | 6.3% |
| r | 3972806 | 6.1% |
| t | 3638627 | 5.6% |
| l | 2985176 | 4.6% |
| s | 2875069 | 4.4% |
| u | 2055702 | 3.2% |
| Other values (208) | 22526066 |
Common
| Value | Count | Frequency (%) |
| 11193478 | ||
| , | 1706617 | 10.1% |
| . | 1642092 | 9.7% |
| 1 | 262130 | 1.5% |
| 2 | 192764 | 1.1% |
| - | 184274 | 1.1% |
| 0 | 182124 | 1.1% |
| 5 | 152353 | 0.9% |
| 147366 | 0.9% | |
| ( | 141352 | 0.8% |
| Other values (84) | 1158687 | 6.8% |
Greek
| Value | Count | Frequency (%) |
| λ | 6 | |
| ν | 5 | |
| Κ | 3 | |
| υ | 3 | |
| ή | 3 | |
| η | 3 | |
| ω | 1 | 3.7% |
| ρ | 1 | 3.7% |
| Π | 1 | 3.7% |
| ά | 1 | 3.7% |
Inherited
| Value | Count | Frequency (%) |
| ̄ | 2 | |
| ̈ | 2 | |
| | 1 | |
| ᷉ | 1 | |
| ̌ | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 81912693 | |
| None | 239779 | 0.3% |
| Punctuation | 266 | < 0.1% |
| Modifier Letters | 122 | < 0.1% |
| Number Forms | 21 | < 0.1% |
| Box Drawing | 9 | < 0.1% |
| Latin Ext Additional | 8 | < 0.1% |
| Arrows | 5 | < 0.1% |
| Diacriticals | 5 | < 0.1% |
| Phonetic Ext | 5 | < 0.1% |
| Other values (4) | 10 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 11193478 | 13.7% | |
| a | 7469383 | 9.1% |
| e | 5570223 | 6.8% |
| o | 5407301 | 6.6% |
| n | 4552644 | 5.6% |
| i | 4136655 | 5.1% |
| r | 3972806 | 4.9% |
| t | 3638627 | 4.4% |
| l | 2985176 | 3.6% |
| s | 2875069 | 3.5% |
| Other values (88) | 30111331 |
None
| Value | Count | Frequency (%) |
| í | 59545 | |
| á | 42977 | |
| é | 29020 | |
| ó | 24065 | |
| ñ | 12011 | 5.0% |
| ã | 9499 | 4.0% |
| ú | 6798 | 2.8% |
| ç | 5940 | 2.5% |
| ü | 4680 | 2.0% |
| ä | 4359 | 1.8% |
| Other values (178) | 40885 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 122 |
Punctuation
| Value | Count | Frequency (%) |
| „ | 99 | |
| ‚ | 53 | |
| “ | 37 | 13.9% |
| … | 30 | 11.3% |
| ” | 15 | 5.6% |
| – | 11 | 4.1% |
| — | 6 | 2.3% |
| › | 6 | 2.3% |
| | 2 | 0.8% |
| | 2 | 0.8% |
| Other values (5) | 5 | 1.9% |
Number Forms
| Value | Count | Frequency (%) |
| ⅓ | 18 | |
| ⅛ | 2 | 9.5% |
| ⅜ | 1 | 4.8% |
Box Drawing
| Value | Count | Frequency (%) |
| ├ | 6 | |
| ┬ | 3 |
Arrows
| Value | Count | Frequency (%) |
| → | 5 |
Block Elements
| Value | Count | Frequency (%) |
| ░ | 3 | |
| ▒ | 1 | 25.0% |
Math Operators
| Value | Count | Frequency (%) |
| ∆ | 3 |
IPA Ext
| Value | Count | Frequency (%) |
| ɶ | 2 |
Diacriticals
| Value | Count | Frequency (%) |
| ̄ | 2 | |
| ̈ | 2 | |
| ̌ | 1 |
Phonetic Ext
| Value | Count | Frequency (%) |
| ᵉ | 2 | |
| ᵍ | 1 | |
| ᴱ | 1 | |
| ᴸ | 1 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ḿ | 2 | |
| ộ | 1 | |
| ấ | 1 | |
| ế | 1 | |
| ṁ | 1 | |
| ắ | 1 | |
| ḗ | 1 |
Diacriticals Sup
| Value | Count | Frequency (%) |
| ᷉ | 1 |
Missing 
| Distinct | 2886 |
|---|---|
| Distinct (%) | 4.2% |
| Missing | 2293088 |
| Missing (%) | 97.1% |
| Memory size | 18.0 MiB |
Length
| Max length | 152 |
|---|---|
| Median length | 124 |
| Mean length | 7.501996052 |
| Min length | 1 |
Unique
| Unique | 752 ? |
|---|---|
| Unique (%) | 1.1% |
Sample
| 1st row | 3600 (3440-3760) ft |
|---|---|
| 2nd row | ~1800 ft. |
| 3rd row | 80 ft |
| 4th row | 160 m |
| 5th row | 150 m |
| Value | Count | Frequency (%) |
| ft | 49451 | |
| m | 16017 | 11.1% |
| ca | 3567 | 2.5% |
| feet | 1112 | 0.8% |
| 200 | 1103 | 0.8% |
| 1100-1350 | 1002 | 0.7% |
| 10 | 898 | 0.6% |
| 20 | 771 | 0.5% |
| 3400 | 723 | 0.5% |
| 3500 | 707 | 0.5% |
| Other values (1929) | 69115 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 102042 | |
| 76081 | ||
| t | 52717 | |
| f | 51306 | |
| 1 | 26577 | 5.2% |
| 3 | 25497 | 5.0% |
| 2 | 24429 | 4.8% |
| 4 | 22152 | 4.3% |
| 5 | 20591 | 4.0% |
| m | 17094 | 3.3% |
| Other values (67) | 94538 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 266424 | |
| Lowercase Letter | 154716 | |
| Space Separator | 76081 | 14.8% |
| Dash Punctuation | 8084 | 1.6% |
| Other Punctuation | 5435 | 1.1% |
| Uppercase Letter | 1378 | 0.3% |
| Open Punctuation | 398 | 0.1% |
| Close Punctuation | 398 | 0.1% |
| Math Symbol | 110 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 52717 | |
| f | 51306 | |
| m | 17094 | 11.0% |
| e | 7739 | 5.0% |
| a | 6180 | 4.0% |
| c | 4297 | 2.8% |
| s | 2575 | 1.7% |
| l | 2322 | 1.5% |
| o | 2060 | 1.3% |
| r | 1687 | 1.1% |
| Other values (15) | 6739 | 4.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 402 | |
| T | 181 | |
| P | 145 | 10.5% |
| W | 140 | 10.2% |
| A | 108 | 7.8% |
| R | 107 | 7.8% |
| C | 61 | 4.4% |
| N | 40 | 2.9% |
| G | 32 | 2.3% |
| S | 21 | 1.5% |
| Other values (12) | 141 | 10.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 102042 | |
| 1 | 26577 | 10.0% |
| 3 | 25497 | 9.6% |
| 2 | 24429 | 9.2% |
| 4 | 22152 | 8.3% |
| 5 | 20591 | 7.7% |
| 6 | 15483 | 5.8% |
| 8 | 12050 | 4.5% |
| 7 | 10078 | 3.8% |
| 9 | 7525 | 2.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4628 | |
| : | 402 | 7.4% |
| ' | 167 | 3.1% |
| , | 153 | 2.8% |
| " | 32 | 0.6% |
| ? | 28 | 0.5% |
| ; | 18 | 0.3% |
| / | 4 | 0.1% |
| & | 3 | 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| < | 54 | |
| + | 19 | 17.3% |
| = | 18 | 16.4% |
| > | 10 | 9.1% |
| ~ | 9 | 8.2% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 361 | |
| [ | 37 | 9.3% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 361 | |
| ] | 37 | 9.3% |
Space Separator
| Value | Count | Frequency (%) |
| 76081 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8084 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 356930 | |
| Latin | 156094 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 52717 | |
| f | 51306 | |
| m | 17094 | 11.0% |
| e | 7739 | 5.0% |
| a | 6180 | 4.0% |
| c | 4297 | 2.8% |
| s | 2575 | 1.6% |
| l | 2322 | 1.5% |
| o | 2060 | 1.3% |
| r | 1687 | 1.1% |
| Other values (37) | 8117 | 5.2% |
Common
| Value | Count | Frequency (%) |
| 0 | 102042 | |
| 76081 | ||
| 1 | 26577 | 7.4% |
| 3 | 25497 | 7.1% |
| 2 | 24429 | 6.8% |
| 4 | 22152 | 6.2% |
| 5 | 20591 | 5.8% |
| 6 | 15483 | 4.3% |
| 8 | 12050 | 3.4% |
| 7 | 10078 | 2.8% |
| Other values (20) | 21950 | 6.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 513024 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 102042 | |
| 76081 | ||
| t | 52717 | |
| f | 51306 | |
| 1 | 26577 | 5.2% |
| 3 | 25497 | 5.0% |
| 2 | 24429 | 4.8% |
| 4 | 22152 | 4.3% |
| 5 | 20591 | 4.0% |
| m | 17094 | 3.3% |
| Other values (67) | 94538 |
verbatimDepth
Text
Missing 
| Distinct | 853 |
|---|---|
| Distinct (%) | 5.9% |
| Missing | 2347005 |
| Missing (%) | 99.4% |
| Memory size | 18.0 MiB |
Length
| Max length | 232132 |
|---|---|
| Median length | 91 |
| Mean length | 24.7726016 |
| Min length | 1 |
Unique
| Unique | 441 ? |
|---|---|
| Unique (%) | 3.0% |
Sample
| 1st row | Littoral |
|---|---|
| 2nd row | 00000000, 00000013 |
| 3rd row | penetration depth: 15cm |
| 4th row | 1 ms ca. |
| 5th row | Intertidal |
| Value | Count | Frequency (%) |
| ca | 6547 | 15.4% |
| intertidal | 3133 | 7.4% |
| surface | 1656 | 3.9% |
| recorded | 744 | 1.7% |
| depths | 742 | 1.7% |
| multiple | 737 | 1.7% |
| false | 567 | 1.3% |
| depth | 504 | 1.2% |
| us | 503 | 1.2% |
| 1 | 349 | 0.8% |
| Other values (5512) | 27047 |
Most occurring characters
| Value | Count | Frequency (%) |
| 41247 | 11.5% | |
| a | 23342 | 6.5% |
| e | 18168 | 5.1% |
| t | 16885 | 4.7% |
| 15290 | 4.3% | |
| r | 12513 | 3.5% |
| c | 12484 | 3.5% |
| i | 12429 | 3.5% |
| 0 | 11460 | 3.2% |
| l | 11180 | 3.1% |
| Other values (93) | 183412 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 170824 | |
| Decimal Number | 57424 | 16.0% |
| Uppercase Letter | 47386 | 13.2% |
| Control | 41433 | 11.6% |
| Other Punctuation | 17500 | 4.9% |
| Space Separator | 15290 | 4.3% |
| Dash Punctuation | 5657 | 1.6% |
| Connector Punctuation | 2318 | 0.6% |
| Open Punctuation | 257 | 0.1% |
| Close Punctuation | 254 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 23342 | |
| e | 18168 | |
| t | 16885 | |
| r | 12513 | 7.3% |
| c | 12484 | 7.3% |
| i | 12429 | 7.3% |
| l | 11180 | 6.5% |
| d | 9861 | 5.8% |
| n | 9804 | 5.7% |
| o | 8826 | 5.2% |
| Other values (31) | 35332 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 5324 | |
| S | 5076 | |
| E | 4373 | |
| C | 4137 | 8.7% |
| A | 3839 | 8.1% |
| N | 3365 | 7.1% |
| M | 3220 | 6.8% |
| T | 2786 | 5.9% |
| R | 2663 | 5.6% |
| U | 1879 | 4.0% |
| Other values (17) | 10724 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 8114 | |
| : | 4020 | |
| , | 3576 | |
| / | 997 | 5.7% |
| ; | 277 | 1.6% |
| " | 233 | 1.3% |
| ' | 179 | 1.0% |
| & | 85 | 0.5% |
| @ | 10 | 0.1% |
| ? | 5 | < 0.1% |
| Other values (2) | 4 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 11460 | |
| 1 | 8175 | |
| 2 | 7952 | |
| 3 | 5184 | |
| 4 | 4941 | |
| 5 | 4328 | 7.5% |
| 8 | 4216 | 7.3% |
| 6 | 3855 | 6.7% |
| 7 | 3699 | 6.4% |
| 9 | 3614 | 6.3% |
Math Symbol
| Value | Count | Frequency (%) |
| < | 34 | |
| = | 17 | |
| + | 10 | 14.9% |
| ~ | 6 | 9.0% |
Control
| Value | Count | Frequency (%) |
| 41247 | ||
| 186 | 0.4% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 252 | |
| [ | 5 | 1.9% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 249 | |
| ] | 5 | 2.0% |
Space Separator
| Value | Count | Frequency (%) |
| 15290 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5657 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2318 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 218210 | |
| Common | 140200 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 23342 | 10.7% |
| e | 18168 | 8.3% |
| t | 16885 | 7.7% |
| r | 12513 | 5.7% |
| c | 12484 | 5.7% |
| i | 12429 | 5.7% |
| l | 11180 | 5.1% |
| d | 9861 | 4.5% |
| n | 9804 | 4.5% |
| o | 8826 | 4.0% |
| Other values (58) | 82718 |
Common
| Value | Count | Frequency (%) |
| 41247 | ||
| 15290 | 10.9% | |
| 0 | 11460 | 8.2% |
| 1 | 8175 | 5.8% |
| . | 8114 | 5.8% |
| 2 | 7952 | 5.7% |
| - | 5657 | 4.0% |
| 3 | 5184 | 3.7% |
| 4 | 4941 | 3.5% |
| 5 | 4328 | 3.1% |
| Other values (25) | 27852 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 358321 | |
| None | 89 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 41247 | 11.5% | |
| a | 23342 | 6.5% |
| e | 18168 | 5.1% |
| t | 16885 | 4.7% |
| 15290 | 4.3% | |
| r | 12513 | 3.5% |
| c | 12484 | 3.5% |
| i | 12429 | 3.5% |
| 0 | 11460 | 3.2% |
| l | 11180 | 3.1% |
| Other values (77) | 183323 |
None
| Value | Count | Frequency (%) |
| é | 16 | |
| í | 14 | |
| ó | 10 | |
| á | 10 | |
| ü | 8 | |
| ô | 6 | 6.7% |
| ö | 5 | 5.6% |
| ã | 4 | 4.5% |
| ñ | 3 | 3.4% |
| ä | 3 | 3.4% |
| Other values (6) | 10 |
decimalLatitude
Text
Missing 
| Distinct | 97854 |
|---|---|
| Distinct (%) | 13.7% |
| Missing | 1649765 |
| Missing (%) | 69.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 84 |
|---|---|
| Median length | 11 |
| Mean length | 6.194806016 |
| Min length | 3 |
Unique
| Unique | 44295 ? |
|---|---|
| Unique (%) | 6.2% |
Sample
| 1st row | 16.8033 |
|---|---|
| 2nd row | 38.9361 |
| 3rd row | 29.2483 |
| 4th row | 44.8831 |
| 5th row | 29.2586 |
| Value | Count | Frequency (%) |
| 25.58 | 2629 | 0.4% |
| 40.6583 | 2215 | 0.3% |
| 26.17 | 1853 | 0.3% |
| 26.5 | 1352 | 0.2% |
| 39.6891 | 1261 | 0.2% |
| 38.9694 | 1127 | 0.2% |
| 39.6306 | 1069 | 0.2% |
| 26.97 | 1018 | 0.1% |
| 38.895 | 1015 | 0.1% |
| 60.75 | 991 | 0.1% |
| Other values (91110) | 697189 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 711708 | |
| 3 | 572272 | |
| 2 | 393297 | |
| 1 | 382548 | |
| 5 | 353375 | |
| 8 | 347442 | |
| 7 | 340351 | |
| 4 | 332050 | |
| 6 | 327777 | |
| 9 | 278204 | 6.3% |
| Other values (27) | 369869 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3558864 | |
| Other Punctuation | 711712 | 16.1% |
| Dash Punctuation | 138221 | 3.1% |
| Lowercase Letter | 57 | < 0.1% |
| Uppercase Letter | 28 | < 0.1% |
| Space Separator | 11 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 8 | |
| s | 7 | |
| a | 6 | |
| t | 5 | |
| n | 5 | |
| i | 5 | |
| d | 4 | |
| r | 4 | |
| u | 3 | 5.3% |
| l | 2 | 3.5% |
| Other values (6) | 8 |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 572272 | |
| 2 | 393297 | |
| 1 | 382548 | |
| 5 | 353375 | |
| 8 | 347442 | |
| 7 | 340351 | |
| 4 | 332050 | |
| 6 | 327777 | |
| 9 | 278204 | |
| 0 | 231548 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 18 | |
| A | 3 | 10.7% |
| L | 2 | 7.1% |
| I | 2 | 7.1% |
| B | 1 | 3.6% |
| N | 1 | 3.6% |
| W | 1 | 3.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 711708 | |
| , | 4 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 138221 |
Space Separator
| Value | Count | Frequency (%) |
| 11 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4408808 | |
| Latin | 85 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 18 | |
| e | 8 | 9.4% |
| s | 7 | 8.2% |
| a | 6 | 7.1% |
| t | 5 | 5.9% |
| n | 5 | 5.9% |
| i | 5 | 5.9% |
| d | 4 | 4.7% |
| r | 4 | 4.7% |
| A | 3 | 3.5% |
| Other values (13) | 20 |
Common
| Value | Count | Frequency (%) |
| . | 711708 | |
| 3 | 572272 | |
| 2 | 393297 | |
| 1 | 382548 | |
| 5 | 353375 | |
| 8 | 347442 | |
| 7 | 340351 | |
| 4 | 332050 | |
| 6 | 327777 | |
| 9 | 278204 | 6.3% |
| Other values (4) | 369784 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4408893 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 711708 | |
| 3 | 572272 | |
| 2 | 393297 | |
| 1 | 382548 | |
| 5 | 353375 | |
| 8 | 347442 | |
| 7 | 340351 | |
| 4 | 332050 | |
| 6 | 327777 | |
| 9 | 278204 | 6.3% |
| Other values (27) | 369869 |
decimalLongitude
Text
Missing 
| Distinct | 102754 |
|---|---|
| Distinct (%) | 14.4% |
| Missing | 1649765 |
| Missing (%) | 69.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 12 |
| Mean length | 7.06795343 |
| Min length | 3 |
Unique
| Unique | 44520 ? |
|---|---|
| Unique (%) | 6.3% |
Sample
| 1st row | -88.0767 |
|---|---|
| 2nd row | -79.6908 |
| 3rd row | -88.1214 |
| 4th row | -68.672 |
| 5th row | -94.9533 |
| Value | Count | Frequency (%) |
| 80.1 | 2655 | 0.4% |
| 105.644 | 1281 | 0.2% |
| 127.848 | 1115 | 0.2% |
| 88.08 | 1095 | 0.2% |
| 77.4714 | 1069 | 0.2% |
| 67.7683 | 1046 | 0.1% |
| 139.5 | 995 | 0.1% |
| 77.0367 | 986 | 0.1% |
| 80.13 | 980 | 0.1% |
| 77.1767 | 933 | 0.1% |
| Other values (95547) | 699553 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 711707 | |
| - | 580730 | |
| 7 | 529418 | |
| 1 | 483727 | |
| 8 | 448602 | |
| 6 | 391174 | |
| 3 | 376072 | |
| 5 | 343881 | |
| 2 | 329551 | |
| 9 | 300995 | |
| Other values (13) | 534462 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3737869 | |
| Other Punctuation | 711707 | 14.1% |
| Dash Punctuation | 580730 | 11.5% |
| Uppercase Letter | 12 | < 0.1% |
| Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 529418 | |
| 1 | 483727 | |
| 8 | 448602 | |
| 6 | 391174 | |
| 3 | 376072 | |
| 5 | 343881 | |
| 2 | 329551 | |
| 9 | 300995 | |
| 4 | 273631 | |
| 0 | 260818 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 2 | |
| A | 2 | |
| N | 1 | |
| O | 1 | |
| T | 1 | |
| H | 1 | |
| M | 1 | |
| E | 1 | |
| I | 1 | |
| C | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 711707 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 580730 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5030307 | |
| Latin | 12 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 711707 | |
| - | 580730 | |
| 7 | 529418 | |
| 1 | 483727 | |
| 8 | 448602 | |
| 6 | 391174 | |
| 3 | 376072 | |
| 5 | 343881 | |
| 2 | 329551 | |
| 9 | 300995 | |
| Other values (3) | 534450 |
Latin
| Value | Count | Frequency (%) |
| R | 2 | |
| A | 2 | |
| N | 1 | |
| O | 1 | |
| T | 1 | |
| H | 1 | |
| M | 1 | |
| E | 1 | |
| I | 1 | |
| C | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5030319 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 711707 | |
| - | 580730 | |
| 7 | 529418 | |
| 1 | 483727 | |
| 8 | 448602 | |
| 6 | 391174 | |
| 3 | 376072 | |
| 5 | 343881 | |
| 2 | 329551 | |
| 9 | 300995 | |
| Other values (13) | 534462 |
coordinateUncertaintyInMeters
Text
Missing 
| Distinct | 5438 |
|---|---|
| Distinct (%) | 12.6% |
| Missing | 2318351 |
| Missing (%) | 98.2% |
| Memory size | 18.0 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 6.314085618 |
| Min length | 3 |
Unique
| Unique | 2012 ? |
|---|---|
| Unique (%) | 4.7% |
Sample
| 1st row | 401.57 |
|---|---|
| 2nd row | 3246.0 |
| 3rd row | 3429.51 |
| 4th row | 801.57 |
| 5th row | 4233.0 |
| Value | Count | Frequency (%) |
| 3036.0 | 447 | 1.0% |
| 100.0 | 377 | 0.9% |
| 347.62 | 374 | 0.9% |
| 500.0 | 363 | 0.8% |
| 16000.0 | 330 | 0.8% |
| 186.68 | 323 | 0.7% |
| 1000.0 | 321 | 0.7% |
| 4615.0 | 287 | 0.7% |
| 1066.0 | 266 | 0.6% |
| 5615.0 | 259 | 0.6% |
| Other values (5428) | 39775 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 43122 | |
| 0 | 42716 | |
| 1 | 31279 | |
| 2 | 23189 | |
| 5 | 21977 | |
| 3 | 21908 | |
| 4 | 20527 | |
| 6 | 19080 | |
| 9 | 16598 | 6.1% |
| 8 | 15972 | 5.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 229154 | |
| Other Punctuation | 43122 | 15.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 42716 | |
| 1 | 31279 | |
| 2 | 23189 | |
| 5 | 21977 | |
| 3 | 21908 | |
| 4 | 20527 | |
| 6 | 19080 | |
| 9 | 16598 | 7.2% |
| 8 | 15972 | 7.0% |
| 7 | 15908 | 6.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 43122 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 272276 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 43122 | |
| 0 | 42716 | |
| 1 | 31279 | |
| 2 | 23189 | |
| 5 | 21977 | |
| 3 | 21908 | |
| 4 | 20527 | |
| 6 | 19080 | |
| 9 | 16598 | 6.1% |
| 8 | 15972 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 272276 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 43122 | |
| 0 | 42716 | |
| 1 | 31279 | |
| 2 | 23189 | |
| 5 | 21977 | |
| 3 | 21908 | |
| 4 | 20527 | |
| 6 | 19080 | |
| 9 | 16598 | 6.1% |
| 8 | 15972 | 5.9% |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361472 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 11 |
| Min length | 11 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Leeward Is. |
|---|
| Value | Count | Frequency (%) |
| leeward | 1 | |
| is | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2 | |
| L | 1 | |
| w | 1 | |
| a | 1 | |
| r | 1 | |
| d | 1 | |
| 1 | ||
| I | 1 | |
| s | 1 | |
| . | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7 | |
| Uppercase Letter | 2 | 18.2% |
| Space Separator | 1 | 9.1% |
| Other Punctuation | 1 | 9.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2 | |
| w | 1 | |
| a | 1 | |
| r | 1 | |
| d | 1 | |
| s | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 1 | |
| I | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9 | |
| Common | 2 | 18.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2 | |
| L | 1 | |
| w | 1 | |
| a | 1 | |
| r | 1 | |
| d | 1 | |
| I | 1 | |
| s | 1 |
Common
| Value | Count | Frequency (%) |
| 1 | ||
| . | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 2 | |
| L | 1 | |
| w | 1 | |
| a | 1 | |
| r | 1 | |
| d | 1 | |
| 1 | ||
| I | 1 | |
| s | 1 | |
| . | 1 |
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361470 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 3190721 |
|---|---|
| 2nd row | Antigua |
| 3rd row | 3869031 |
| Value | Count | Frequency (%) |
| 3190721 | 1 | |
| antigua | 1 | |
| 3869031 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 3 | |
| 1 | 3 | |
| 9 | 2 | 9.5% |
| 0 | 2 | 9.5% |
| 7 | 1 | 4.8% |
| 2 | 1 | 4.8% |
| A | 1 | 4.8% |
| n | 1 | 4.8% |
| t | 1 | 4.8% |
| i | 1 | 4.8% |
| Other values (5) | 5 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 14 | |
| Lowercase Letter | 6 | |
| Uppercase Letter | 1 | 4.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 3 | |
| 1 | 3 | |
| 9 | 2 | |
| 0 | 2 | |
| 7 | 1 | 7.1% |
| 2 | 1 | 7.1% |
| 8 | 1 | 7.1% |
| 6 | 1 | 7.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 1 | |
| t | 1 | |
| i | 1 | |
| g | 1 | |
| u | 1 | |
| a | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 14 | |
| Latin | 7 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 3 | |
| 1 | 3 | |
| 9 | 2 | |
| 0 | 2 | |
| 7 | 1 | 7.1% |
| 2 | 1 | 7.1% |
| 8 | 1 | 7.1% |
| 6 | 1 | 7.1% |
Latin
| Value | Count | Frequency (%) |
| A | 1 | |
| n | 1 | |
| t | 1 | |
| i | 1 | |
| g | 1 | |
| u | 1 | |
| a | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 3 | |
| 1 | 3 | |
| 9 | 2 | 9.5% |
| 0 | 2 | 9.5% |
| 7 | 1 | 4.8% |
| 2 | 1 | 4.8% |
| A | 1 | 4.8% |
| n | 1 | 4.8% |
| t | 1 | 4.8% |
| i | 1 | 4.8% |
| Other values (5) | 5 |
Missing 
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2103318 |
| Missing (%) | 89.1% |
| Memory size | 18.0 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 23 |
| Mean length | 22.71790204 |
| Min length | 2 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Degrees Minutes Seconds |
|---|---|
| 2nd row | Degrees Minutes Seconds |
| 3rd row | Degrees Minutes Seconds |
| 4th row | Degrees Minutes Seconds |
| 5th row | Degrees Minutes Seconds |
| Value | Count | Frequency (%) |
| degrees | 255841 | |
| minutes | 249742 | |
| seconds | 249742 | |
| decimal | 6099 | 0.8% |
| township | 1828 | 0.2% |
| range | 1828 | 0.2% |
| utm | 195 | < 0.1% |
| marsden | 143 | < 0.1% |
| square | 143 | < 0.1% |
| unknown | 140 | < 0.1% |
| Other values (2) | 8 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1275220 | |
| s | 757296 | |
| 507554 | 8.7% | |
| n | 503703 | 8.6% |
| g | 257669 | 4.4% |
| i | 257669 | 4.4% |
| r | 256127 | 4.4% |
| d | 255978 | 4.4% |
| D | 255854 | 4.4% |
| c | 255841 | 4.4% |
| Other values (19) | 1281829 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4597158 | |
| Uppercase Letter | 760028 | 13.0% |
| Space Separator | 507554 | 8.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1275220 | |
| s | 757296 | |
| n | 503703 | 11.0% |
| g | 257669 | 5.6% |
| i | 257669 | 5.6% |
| r | 256127 | 5.6% |
| d | 255978 | 5.6% |
| c | 255841 | 5.6% |
| o | 251710 | 5.5% |
| u | 249885 | 5.4% |
| Other values (9) | 276060 | 6.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 255854 | |
| M | 250080 | |
| S | 249885 | |
| T | 2023 | 0.3% |
| R | 1828 | 0.2% |
| U | 342 | < 0.1% |
| A | 8 | < 0.1% |
| Q | 7 | < 0.1% |
| G | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 507554 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5357186 | |
| Common | 507554 | 8.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1275220 | |
| s | 757296 | |
| n | 503703 | 9.4% |
| g | 257669 | 4.8% |
| i | 257669 | 4.8% |
| r | 256127 | 4.8% |
| d | 255978 | 4.8% |
| D | 255854 | 4.8% |
| c | 255841 | 4.8% |
| o | 251710 | 4.7% |
| Other values (18) | 1030119 |
Common
| Value | Count | Frequency (%) |
| 507554 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5864740 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1275220 | |
| s | 757296 | |
| 507554 | 8.7% | |
| n | 503703 | 8.6% |
| g | 257669 | 4.4% |
| i | 257669 | 4.4% |
| r | 256127 | 4.4% |
| d | 255978 | 4.4% |
| D | 255854 | 4.4% |
| c | 255841 | 4.4% |
| Other values (19) | 1281829 |
verbatimSRS
Text
Missing 
| Distinct | 6 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361467 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 4 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1961-01-09 |
|---|---|
| 2nd row | 2003-06-02 |
| 3rd row | 1955 |
| 4th row | 1911-08-27 |
| 5th row | 1907 |
| Value | Count | Frequency (%) |
| 1961-01-09 | 1 | |
| 2003-06-02 | 1 | |
| 1955 | 1 | |
| 1911-08-27 | 1 | |
| 1907 | 1 | |
| 1876 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 9 | |
| 0 | 8 | |
| - | 6 | |
| 9 | 5 | |
| 6 | 3 | 7.1% |
| 2 | 3 | 7.1% |
| 7 | 3 | 7.1% |
| 5 | 2 | 4.8% |
| 8 | 2 | 4.8% |
| 3 | 1 | 2.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 36 | |
| Dash Punctuation | 6 | 14.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 9 | |
| 0 | 8 | |
| 9 | 5 | |
| 6 | 3 | 8.3% |
| 2 | 3 | 8.3% |
| 7 | 3 | 8.3% |
| 5 | 2 | 5.6% |
| 8 | 2 | 5.6% |
| 3 | 1 | 2.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 42 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 9 | |
| 0 | 8 | |
| - | 6 | |
| 9 | 5 | |
| 6 | 3 | 7.1% |
| 2 | 3 | 7.1% |
| 7 | 3 | 7.1% |
| 5 | 2 | 4.8% |
| 8 | 2 | 4.8% |
| 3 | 1 | 2.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 42 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 9 | |
| 0 | 8 | |
| - | 6 | |
| 9 | 5 | |
| 6 | 3 | 7.1% |
| 2 | 3 | 7.1% |
| 7 | 3 | 7.1% |
| 5 | 2 | 4.8% |
| 8 | 2 | 4.8% |
| 3 | 1 | 2.4% |
footprintSRS
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361470 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.333333333 |
| Min length | 1 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 9 |
|---|---|
| 2nd row | 153 |
| 3rd row | 239 |
| Value | Count | Frequency (%) |
| 9 | 1 | |
| 153 | 1 | |
| 239 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 9 | 2 | |
| 3 | 2 | |
| 1 | 1 | |
| 5 | 1 | |
| 2 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 2 | |
| 3 | 2 | |
| 1 | 1 | |
| 5 | 1 | |
| 2 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 9 | 2 | |
| 3 | 2 | |
| 1 | 1 | |
| 5 | 1 | |
| 2 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9 | 2 | |
| 3 | 2 | |
| 1 | 1 | |
| 5 | 1 | |
| 2 | 1 |
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361469 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 71 |
|---|---|
| Median length | 37 |
| Mean length | 19.5 |
| Min length | 1 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 9 |
|---|---|
| 2nd row | 153 |
| 3rd row | 239 |
| 4th row | John's Hope, Willings Forest Reserve, trail up from dam through forest. |
| Value | Count | Frequency (%) |
| forest | 2 | |
| 9 | 1 | 7.1% |
| 153 | 1 | 7.1% |
| 239 | 1 | 7.1% |
| john's | 1 | 7.1% |
| hope | 1 | 7.1% |
| willings | 1 | 7.1% |
| reserve | 1 | 7.1% |
| trail | 1 | 7.1% |
| up | 1 | 7.1% |
| Other values (3) | 3 |
Most occurring characters
| Value | Count | Frequency (%) |
| 10 | 12.8% | |
| o | 6 | 7.7% |
| r | 6 | 7.7% |
| e | 6 | 7.7% |
| s | 5 | 6.4% |
| t | 4 | 5.1% |
| i | 3 | 3.8% |
| h | 3 | 3.8% |
| l | 3 | 3.8% |
| 9 | 2 | 2.6% |
| Other values (21) | 30 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 52 | |
| Space Separator | 10 | 12.8% |
| Decimal Number | 7 | 9.0% |
| Uppercase Letter | 5 | 6.4% |
| Other Punctuation | 4 | 5.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 6 | |
| r | 6 | |
| e | 6 | |
| s | 5 | |
| t | 4 | 7.7% |
| i | 3 | 5.8% |
| h | 3 | 5.8% |
| l | 3 | 5.8% |
| m | 2 | 3.8% |
| f | 2 | 3.8% |
| Other values (7) | 12 |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 2 | |
| 3 | 2 | |
| 1 | 1 | |
| 2 | 1 | |
| 5 | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 1 | |
| H | 1 | |
| F | 1 | |
| R | 1 | |
| J | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 2 | |
| ' | 1 | |
| . | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 10 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 57 | |
| Common | 21 | 26.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 6 | 10.5% |
| r | 6 | 10.5% |
| e | 6 | 10.5% |
| s | 5 | 8.8% |
| t | 4 | 7.0% |
| i | 3 | 5.3% |
| h | 3 | 5.3% |
| l | 3 | 5.3% |
| m | 2 | 3.5% |
| f | 2 | 3.5% |
| Other values (12) | 17 |
Common
| Value | Count | Frequency (%) |
| 10 | ||
| 9 | 2 | 9.5% |
| , | 2 | 9.5% |
| 3 | 2 | 9.5% |
| 1 | 1 | 4.8% |
| ' | 1 | 4.8% |
| 2 | 1 | 4.8% |
| 5 | 1 | 4.8% |
| . | 1 | 4.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 78 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 10 | 12.8% | |
| o | 6 | 7.7% |
| r | 6 | 7.7% |
| e | 6 | 7.7% |
| s | 5 | 6.4% |
| t | 4 | 5.1% |
| i | 3 | 3.8% |
| h | 3 | 3.8% |
| l | 3 | 3.8% |
| 9 | 2 | 2.6% |
| Other values (21) | 30 |
georeferencedBy
Text
Missing 
| Distinct | 9 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361464 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 29 |
|---|---|
| Median length | 4 |
| Mean length | 8.111111111 |
| Min length | 4 |
Unique
| Unique | 9 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Drosera L. |
|---|---|
| 2nd row | 1961 |
| 3rd row | 2003 |
| 4th row | 1955 |
| 5th row | 1911 |
| Value | Count | Frequency (%) |
| drosera | 1 | 7.7% |
| l | 1 | 7.7% |
| 1961 | 1 | 7.7% |
| 2003 | 1 | 7.7% |
| 1955 | 1 | 7.7% |
| 1911 | 1 | 7.7% |
| 1889-03-29 | 1 | 7.7% |
| 1907 | 1 | 7.7% |
| miconia | 1 | 7.7% |
| coronata | 1 | 7.7% |
| Other values (3) | 3 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 9 | 12.3% |
| 9 | 6 | 8.2% |
| o | 5 | 6.8% |
| a | 4 | 5.5% |
| 4 | 5.5% | |
| 0 | 4 | 5.5% |
| r | 3 | 4.1% |
| n | 3 | 4.1% |
| . | 3 | 4.1% |
| 8 | 3 | 4.1% |
| Other values (20) | 29 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 32 | |
| Lowercase Letter | 24 | |
| Uppercase Letter | 6 | 8.2% |
| Space Separator | 4 | 5.5% |
| Other Punctuation | 3 | 4.1% |
| Dash Punctuation | 2 | 2.7% |
| Open Punctuation | 1 | 1.4% |
| Close Punctuation | 1 | 1.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 5 | |
| a | 4 | |
| r | 3 | |
| n | 3 | |
| c | 2 | 8.3% |
| i | 2 | 8.3% |
| e | 1 | 4.2% |
| s | 1 | 4.2% |
| t | 1 | 4.2% |
| p | 1 | 4.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 9 | |
| 9 | 6 | |
| 0 | 4 | |
| 8 | 3 | 9.4% |
| 7 | 2 | 6.2% |
| 5 | 2 | 6.2% |
| 3 | 2 | 6.2% |
| 2 | 2 | 6.2% |
| 6 | 2 | 6.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 2 | |
| M | 1 | |
| L | 1 | |
| B | 1 | |
| C | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 4 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 43 | |
| Latin | 30 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 5 | |
| a | 4 | |
| r | 3 | |
| n | 3 | |
| D | 2 | 6.7% |
| c | 2 | 6.7% |
| i | 2 | 6.7% |
| M | 1 | 3.3% |
| L | 1 | 3.3% |
| e | 1 | 3.3% |
| Other values (6) | 6 |
Common
| Value | Count | Frequency (%) |
| 1 | 9 | |
| 9 | 6 | |
| 4 | ||
| 0 | 4 | |
| . | 3 | 7.0% |
| 8 | 3 | 7.0% |
| 7 | 2 | 4.7% |
| - | 2 | 4.7% |
| 5 | 2 | 4.7% |
| 3 | 2 | 4.7% |
| Other values (4) | 6 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 73 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 9 | 12.3% |
| 9 | 6 | 8.2% |
| o | 5 | 6.8% |
| a | 4 | 5.5% |
| 4 | 5.5% | |
| 0 | 4 | 5.5% |
| r | 3 | 4.1% |
| n | 3 | 4.1% |
| . | 3 | 4.1% |
| 8 | 3 | 4.1% |
| Other values (20) | 29 |
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361470 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 6 |
| 3rd row | 8 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 6 | 1 | |
| 8 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 6 | 1 | |
| 8 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 6 | 1 | |
| 8 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 6 | 1 | |
| 8 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 6 | 1 | |
| 8 | 1 |
Missing 
| Distinct | 2394 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 2055868 |
| Missing (%) | 87.1% |
| Memory size | 18.0 MiB |
Length
| Max length | 302 |
|---|---|
| Median length | 300 |
| Mean length | 25.55988613 |
| Min length | 1 |
Unique
| Unique | 798 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | unknown, from legacy |
|---|---|
| 2nd row | GEOLocate |
| 3rd row | ArcGIS software with data from New Mexico Resource Geographic Information System Program (http://rgis.unm.edu) and other inhouse resources (historical maps aiding with name changes), MaNIS/HerpNET/ORNIS Georeferencing Guidelines |
| 4th row | Google Earth |
| 5th row | Alexandria Digital Library Gazetteer, MaNIS/HerpNET/ORNIS Georeferencing Guidelines |
| Value | Count | Frequency (%) |
| from | 130830 | 12.9% |
| unknown | 129199 | 12.8% |
| legacy | 128679 | 12.7% |
| 54944 | 5.4% | |
| earth | 40154 | 4.0% |
| geolocate | 36281 | 3.6% |
| georeferencing | 34967 | 3.5% |
| manis/herpnet/ornis | 34312 | 3.4% |
| guidelines | 34310 | 3.4% |
| gazetteer | 20215 | 2.0% |
| Other values (2835) | 367943 |
Most occurring characters
| Value | Count | Frequency (%) |
| 706229 | 9.0% | |
| e | 670307 | 8.6% |
| o | 567440 | 7.3% |
| n | 560593 | 7.2% |
| a | 452507 | 5.8% |
| r | 410585 | 5.3% |
| l | 272912 | 3.5% |
| g | 263573 | 3.4% |
| G | 248223 | 3.2% |
| c | 246755 | 3.2% |
| Other values (69) | 3412105 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5286976 | |
| Uppercase Letter | 1165753 | 14.9% |
| Space Separator | 706229 | 9.0% |
| Other Punctuation | 338184 | 4.3% |
| Decimal Number | 246713 | 3.2% |
| Open Punctuation | 24664 | 0.3% |
| Close Punctuation | 24612 | 0.3% |
| Dash Punctuation | 17941 | 0.2% |
| Math Symbol | 95 | < 0.1% |
| Connector Punctuation | 62 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 670307 | |
| o | 567440 | 10.7% |
| n | 560593 | 10.6% |
| a | 452507 | 8.6% |
| r | 410585 | 7.8% |
| l | 272912 | 5.2% |
| g | 263573 | 5.0% |
| c | 246755 | 4.7% |
| u | 211476 | 4.0% |
| i | 207930 | 3.9% |
| Other values (17) | 1422898 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 248223 | |
| S | 136912 | |
| N | 129186 | |
| E | 114336 | |
| I | 85150 | 7.3% |
| O | 74320 | 6.4% |
| M | 68715 | 5.9% |
| T | 66001 | 5.7% |
| L | 48246 | 4.1% |
| R | 38810 | 3.3% |
| Other values (15) | 155854 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 201579 | |
| / | 75784 | 22.4% |
| : | 26838 | 7.9% |
| . | 25004 | 7.4% |
| ; | 3908 | 1.2% |
| ! | 2240 | 0.7% |
| # | 1769 | 0.5% |
| ' | 686 | 0.2% |
| & | 350 | 0.1% |
| ? | 21 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 112148 | |
| 2 | 38851 | 15.7% |
| 1 | 35160 | 14.3% |
| 4 | 20531 | 8.3% |
| 5 | 10306 | 4.2% |
| 9 | 7460 | 3.0% |
| 7 | 6562 | 2.7% |
| 6 | 6475 | 2.6% |
| 3 | 5706 | 2.3% |
| 8 | 3514 | 1.4% |
Space Separator
| Value | Count | Frequency (%) |
| 706229 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 24664 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 24612 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 17941 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 95 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 62 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6452729 | |
| Common | 1358500 | 17.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 670307 | 10.4% |
| o | 567440 | 8.8% |
| n | 560593 | 8.7% |
| a | 452507 | 7.0% |
| r | 410585 | 6.4% |
| l | 272912 | 4.2% |
| g | 263573 | 4.1% |
| G | 248223 | 3.8% |
| c | 246755 | 3.8% |
| u | 211476 | 3.3% |
| Other values (42) | 2548358 |
Common
| Value | Count | Frequency (%) |
| 706229 | ||
| , | 201579 | 14.8% |
| 0 | 112148 | 8.3% |
| / | 75784 | 5.6% |
| 2 | 38851 | 2.9% |
| 1 | 35160 | 2.6% |
| : | 26838 | 2.0% |
| . | 25004 | 1.8% |
| ( | 24664 | 1.8% |
| ) | 24612 | 1.8% |
| Other values (17) | 87631 | 6.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7810197 | |
| None | 1032 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 706229 | 9.0% | |
| e | 670307 | 8.6% |
| o | 567440 | 7.3% |
| n | 560593 | 7.2% |
| a | 452507 | 5.8% |
| r | 410585 | 5.3% |
| l | 272912 | 3.5% |
| g | 263573 | 3.4% |
| G | 248223 | 3.2% |
| c | 246755 | 3.2% |
| Other values (68) | 3411073 |
None
| Value | Count | Frequency (%) |
| í | 1032 |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361471 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 12.5 |
| Mean length | 12.5 |
| Min length | 2 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 88 |
|---|---|
| 2nd row | 1876 00 00 - 0000 00 00 |
| Value | Count | Frequency (%) |
| 00 | 4 | |
| 88 | 1 | 12.5% |
| 1876 | 1 | 12.5% |
| 1 | 12.5% | |
| 0000 | 1 | 12.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 12 | |
| 6 | ||
| 8 | 3 | 12.0% |
| 1 | 1 | 4.0% |
| 7 | 1 | 4.0% |
| 6 | 1 | 4.0% |
| - | 1 | 4.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 18 | |
| Space Separator | 6 | 24.0% |
| Dash Punctuation | 1 | 4.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 12 | |
| 8 | 3 | 16.7% |
| 1 | 1 | 5.6% |
| 7 | 1 | 5.6% |
| 6 | 1 | 5.6% |
Space Separator
| Value | Count | Frequency (%) |
| 6 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 25 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 12 | |
| 6 | ||
| 8 | 3 | 12.0% |
| 1 | 1 | 4.0% |
| 7 | 1 | 4.0% |
| 6 | 1 | 4.0% |
| - | 1 | 4.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 25 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 12 | |
| 6 | ||
| 8 | 3 | 12.0% |
| 1 | 1 | 4.0% |
| 7 | 1 | 4.0% |
| 6 | 1 | 4.0% |
| - | 1 | 4.0% |
Missing 
| Distinct | 4929 |
|---|---|
| Distinct (%) | 9.5% |
| Missing | 2309427 |
| Missing (%) | 97.8% |
| Memory size | 18.0 MiB |
Length
| Max length | 182 |
|---|---|
| Median length | 126 |
| Mean length | 21.75160435 |
| Min length | 1 |
Unique
| Unique | 2478 ? |
|---|---|
| Unique (%) | 4.8% |
Sample
| 1st row | Locality extent = 400 m |
|---|---|
| 2nd row | Locality extent = 0.6 |
| 3rd row | Locality extent = 1.059 mi. |
| 4th row | Locality extent = 800 m |
| 5th row | Coordinate Uncertainty In Meters: 44967 |
| Value | Count | Frequency (%) |
| locality | 34471 | |
| 34367 | ||
| extent | 34334 | |
| mi | 10224 | 4.9% |
| ca | 4838 | 2.3% |
| km | 2968 | 1.4% |
| approximate | 2550 | 1.2% |
| in | 2301 | 1.1% |
| coordinate | 2093 | 1.0% |
| meters | 2084 | 1.0% |
| Other values (5049) | 76888 |
Most occurring characters
| Value | Count | Frequency (%) |
| 155072 | 13.7% | |
| t | 128158 | 11.3% |
| e | 96698 | 8.5% |
| a | 61605 | 5.4% |
| i | 59118 | 5.2% |
| o | 54266 | 4.8% |
| n | 53362 | 4.7% |
| l | 42258 | 3.7% |
| c | 39954 | 3.5% |
| . | 39122 | 3.5% |
| Other values (71) | 402471 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 709218 | |
| Space Separator | 155072 | 13.7% |
| Decimal Number | 108626 | 9.6% |
| Uppercase Letter | 78739 | 7.0% |
| Other Punctuation | 45167 | 4.0% |
| Math Symbol | 34336 | 3.0% |
| Dash Punctuation | 538 | < 0.1% |
| Open Punctuation | 193 | < 0.1% |
| Close Punctuation | 193 | < 0.1% |
| Initial Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 128158 | |
| e | 96698 | |
| a | 61605 | |
| i | 59118 | |
| o | 54266 | |
| n | 53362 | |
| l | 42258 | 6.0% |
| c | 39954 | 5.6% |
| y | 38215 | 5.4% |
| x | 37841 | 5.3% |
| Other values (16) | 97743 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 34663 | |
| C | 8720 | 11.1% |
| A | 5444 | 6.9% |
| M | 2925 | 3.7% |
| I | 2617 | 3.3% |
| G | 2500 | 3.2% |
| P | 2245 | 2.9% |
| U | 2183 | 2.8% |
| D | 2151 | 2.7% |
| S | 2020 | 2.6% |
| Other values (16) | 13271 | 16.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 20572 | |
| 1 | 18161 | |
| 5 | 14508 | |
| 2 | 13297 | |
| 3 | 10577 | |
| 6 | 8291 | |
| 4 | 6768 | 6.2% |
| 7 | 6465 | 6.0% |
| 8 | 5819 | 5.4% |
| 9 | 4168 | 3.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 39122 | |
| : | 2227 | 4.9% |
| , | 1593 | 3.5% |
| ; | 1587 | 3.5% |
| / | 525 | 1.2% |
| ' | 93 | 0.2% |
| " | 9 | < 0.1% |
| & | 6 | < 0.1% |
| # | 5 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 34320 | |
| + | 16 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 189 | |
| [ | 4 | 2.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 189 | |
| ] | 4 | 2.1% |
Space Separator
| Value | Count | Frequency (%) |
| 155072 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 538 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 1 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 787957 | |
| Common | 344127 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 128158 | |
| e | 96698 | |
| a | 61605 | 7.8% |
| i | 59118 | 7.5% |
| o | 54266 | 6.9% |
| n | 53362 | 6.8% |
| l | 42258 | 5.4% |
| c | 39954 | 5.1% |
| y | 38215 | 4.8% |
| x | 37841 | 4.8% |
| Other values (42) | 176482 |
Common
| Value | Count | Frequency (%) |
| 155072 | ||
| . | 39122 | 11.4% |
| = | 34320 | 10.0% |
| 0 | 20572 | 6.0% |
| 1 | 18161 | 5.3% |
| 5 | 14508 | 4.2% |
| 2 | 13297 | 3.9% |
| 3 | 10577 | 3.1% |
| 6 | 8291 | 2.4% |
| 4 | 6768 | 2.0% |
| Other values (19) | 23439 | 6.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1132082 | |
| Punctuation | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 155072 | 13.7% | |
| t | 128158 | 11.3% |
| e | 96698 | 8.5% |
| a | 61605 | 5.4% |
| i | 59118 | 5.2% |
| o | 54266 | 4.8% |
| n | 53362 | 4.7% |
| l | 42258 | 3.7% |
| c | 39954 | 3.5% |
| . | 39122 | 3.5% |
| Other values (69) | 402469 |
Punctuation
| Value | Count | Frequency (%) |
| “ | 1 | |
| ” | 1 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361472 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 3 |
|---|
| Value | Count | Frequency (%) |
| 3 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 1 |
earliestEonOrLowestEonothem
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361472 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 29 |
|---|
| Value | Count | Frequency (%) |
| 29 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 1 | |
| 9 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1 | |
| 9 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 1 | |
| 9 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 1 | |
| 9 | 1 |
latestEonOrHighestEonothem
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361470 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 67 |
|---|---|
| Median length | 51 |
| Mean length | 43 |
| Min length | 11 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Plantae, Dicotyledonae, Caryophyllales, Droseraceae |
|---|---|
| 2nd row | 29 Mar 1889 |
| 3rd row | Plantae, Dicotyledonae, Myrtales, Melastomataceae, Melastomatoideae |
| Value | Count | Frequency (%) |
| plantae | 2 | |
| dicotyledonae | 2 | |
| caryophyllales | 1 | |
| droseraceae | 1 | |
| 29 | 1 | |
| mar | 1 | |
| 1889 | 1 | |
| myrtales | 1 | |
| melastomataceae | 1 | |
| melastomatoideae | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 19 | |
| e | 17 | |
| l | 10 | 7.8% |
| t | 9 | 7.0% |
| 9 | 7.0% | |
| o | 9 | 7.0% |
| , | 7 | 5.4% |
| y | 5 | 3.9% |
| r | 5 | 3.9% |
| s | 5 | 3.9% |
| Other values (15) | 34 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 97 | |
| Uppercase Letter | 10 | 7.8% |
| Space Separator | 9 | 7.0% |
| Other Punctuation | 7 | 5.4% |
| Decimal Number | 6 | 4.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 19 | |
| e | 17 | |
| l | 10 | |
| t | 9 | |
| o | 9 | |
| y | 5 | 5.2% |
| r | 5 | 5.2% |
| s | 5 | 5.2% |
| n | 4 | 4.1% |
| c | 4 | 4.1% |
| Other values (5) | 10 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 4 | |
| D | 3 | |
| P | 2 | |
| C | 1 | 10.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 2 | |
| 9 | 2 | |
| 2 | 1 | |
| 1 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 9 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 107 | |
| Common | 22 | 17.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 19 | |
| e | 17 | |
| l | 10 | |
| t | 9 | |
| o | 9 | |
| y | 5 | 4.7% |
| r | 5 | 4.7% |
| s | 5 | 4.7% |
| n | 4 | 3.7% |
| M | 4 | 3.7% |
| Other values (9) | 20 |
Common
| Value | Count | Frequency (%) |
| 9 | ||
| , | 7 | |
| 8 | 2 | 9.1% |
| 9 | 2 | 9.1% |
| 2 | 1 | 4.5% |
| 1 | 1 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 129 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 19 | |
| e | 17 | |
| l | 10 | 7.8% |
| t | 9 | 7.0% |
| 9 | 7.0% | |
| o | 9 | 7.0% |
| , | 7 | 5.4% |
| y | 5 | 3.9% |
| r | 5 | 3.9% |
| s | 5 | 3.9% |
| Other values (15) | 34 |
earliestEraOrLowestErathem
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 2361471 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Plantae |
|---|---|
| 2nd row | Plantae |
| Value | Count | Frequency (%) |
| plantae | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4 | |
| P | 2 | |
| l | 2 | |
| n | 2 | |
| t | 2 | |
| e | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12 | |
| Uppercase Letter | 2 | 14.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4 | |
| l | 2 | |
| n | 2 | |
| t | 2 | |
| e | 2 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4 | |
| P | 2 | |
| l | 2 | |
| n | 2 | |
| t | 2 | |
| e | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4 | |
| P | 2 | |
| l | 2 | |
| n | 2 | |
| t | 2 | |
| e | 2 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 2361471 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 12 |
| Min length | 12 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Tracheophyta |
|---|---|
| 2nd row | Tracheophyta |
| Value | Count | Frequency (%) |
| tracheophyta | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4 | |
| h | 4 | |
| T | 2 | |
| r | 2 | |
| c | 2 | |
| e | 2 | |
| o | 2 | |
| p | 2 | |
| y | 2 | |
| t | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 22 | |
| Uppercase Letter | 2 | 8.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4 | |
| h | 4 | |
| r | 2 | |
| c | 2 | |
| e | 2 | |
| o | 2 | |
| p | 2 | |
| y | 2 | |
| t | 2 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 24 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4 | |
| h | 4 | |
| T | 2 | |
| r | 2 | |
| c | 2 | |
| e | 2 | |
| o | 2 | |
| p | 2 | |
| y | 2 | |
| t | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 24 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4 | |
| h | 4 | |
| T | 2 | |
| r | 2 | |
| c | 2 | |
| e | 2 | |
| o | 2 | |
| p | 2 | |
| y | 2 | |
| t | 2 |
earliestPeriodOrLowestSystem
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 2361471 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Magnoliopsida |
|---|---|
| 2nd row | Magnoliopsida |
| Value | Count | Frequency (%) |
| magnoliopsida | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4 | |
| o | 4 | |
| i | 4 | |
| M | 2 | |
| g | 2 | |
| n | 2 | |
| l | 2 | |
| p | 2 | |
| s | 2 | |
| d | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 24 | |
| Uppercase Letter | 2 | 7.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4 | |
| o | 4 | |
| i | 4 | |
| g | 2 | |
| n | 2 | |
| l | 2 | |
| p | 2 | |
| s | 2 | |
| d | 2 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 26 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4 | |
| o | 4 | |
| i | 4 | |
| M | 2 | |
| g | 2 | |
| n | 2 | |
| l | 2 | |
| p | 2 | |
| s | 2 | |
| d | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 26 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4 | |
| o | 4 | |
| i | 4 | |
| M | 2 | |
| g | 2 | |
| n | 2 | |
| l | 2 | |
| p | 2 | |
| s | 2 | |
| d | 2 |
latestPeriodOrHighestSystem
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361471 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 11 |
| Mean length | 11 |
| Min length | 8 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Caryophyllales |
|---|---|
| 2nd row | Myrtales |
| Value | Count | Frequency (%) |
| caryophyllales | 1 | |
| myrtales | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 4 | |
| a | 3 | |
| y | 3 | |
| r | 2 | |
| e | 2 | |
| s | 2 | |
| C | 1 | 4.5% |
| o | 1 | 4.5% |
| p | 1 | 4.5% |
| h | 1 | 4.5% |
| Other values (2) | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 20 | |
| Uppercase Letter | 2 | 9.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 4 | |
| a | 3 | |
| y | 3 | |
| r | 2 | |
| e | 2 | |
| s | 2 | |
| o | 1 | 5.0% |
| p | 1 | 5.0% |
| h | 1 | 5.0% |
| t | 1 | 5.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1 | |
| M | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 22 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 4 | |
| a | 3 | |
| y | 3 | |
| r | 2 | |
| e | 2 | |
| s | 2 | |
| C | 1 | 4.5% |
| o | 1 | 4.5% |
| p | 1 | 4.5% |
| h | 1 | 4.5% |
| Other values (2) | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 22 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 4 | |
| a | 3 | |
| y | 3 | |
| r | 2 | |
| e | 2 | |
| s | 2 | |
| C | 1 | 4.5% |
| o | 1 | 4.5% |
| p | 1 | 4.5% |
| h | 1 | 4.5% |
| Other values (2) | 2 |
earliestEpochOrLowestSeries
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361472 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 5410907 |
|---|
| Value | Count | Frequency (%) |
| 5410907 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 5 | 1 | |
| 4 | 1 | |
| 1 | 1 | |
| 9 | 1 | |
| 7 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 5 | 1 | |
| 4 | 1 | |
| 1 | 1 | |
| 9 | 1 | |
| 7 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 5 | 1 | |
| 4 | 1 | |
| 1 | 1 | |
| 9 | 1 | |
| 7 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 5 | 1 | |
| 4 | 1 | |
| 1 | 1 | |
| 9 | 1 | |
| 7 | 1 |
latestEpochOrHighestSeries
Text
Missing 
| Distinct | 8 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361465 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 69 |
|---|---|
| Median length | 39.5 |
| Mean length | 35.875 |
| Min length | 11 |
Unique
| Unique | 8 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Droseraceae |
|---|---|
| 2nd row | Asia, Taiwan |
| 3rd row | North America, United States, Oklahoma, Pontotoc County |
| 4th row | North America, United States, Alaska |
| 5th row | North America, United States, Massachusetts |
| Value | Count | Frequency (%) |
| north | 5 | |
| united | 5 | |
| states | 5 | |
| america | 4 | |
| county | 2 | 5.7% |
| massachusetts | 2 | 5.7% |
| ocean | 1 | 2.9% |
| atlantic | 1 | 2.9% |
| melastomataceae | 1 | 2.9% |
| cochise | 1 | 2.9% |
| Other values (8) | 8 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 33 | 11.5% |
| a | 31 | 10.8% |
| 27 | 9.4% | |
| e | 25 | 8.7% |
| s | 19 | 6.6% |
| o | 15 | 5.2% |
| i | 14 | 4.9% |
| , | 14 | 4.9% |
| n | 13 | 4.5% |
| r | 13 | 4.5% |
| Other values (22) | 83 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 211 | |
| Uppercase Letter | 35 | 12.2% |
| Space Separator | 27 | 9.4% |
| Other Punctuation | 14 | 4.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 33 | |
| a | 31 | |
| e | 25 | |
| s | 19 | |
| o | 15 | |
| i | 14 | |
| n | 13 | 6.2% |
| r | 13 | 6.2% |
| c | 12 | 5.7% |
| h | 9 | 4.3% |
| Other values (9) | 27 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 8 | |
| N | 5 | |
| U | 5 | |
| S | 5 | |
| M | 3 | 8.6% |
| C | 3 | 8.6% |
| O | 2 | 5.7% |
| B | 1 | 2.9% |
| D | 1 | 2.9% |
| P | 1 | 2.9% |
Space Separator
| Value | Count | Frequency (%) |
| 27 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 14 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 246 | |
| Common | 41 | 14.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 33 | |
| a | 31 | |
| e | 25 | |
| s | 19 | 7.7% |
| o | 15 | 6.1% |
| i | 14 | 5.7% |
| n | 13 | 5.3% |
| r | 13 | 5.3% |
| c | 12 | 4.9% |
| h | 9 | 3.7% |
| Other values (20) | 62 |
Common
| Value | Count | Frequency (%) |
| 27 | ||
| , | 14 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 287 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 33 | 11.5% |
| a | 31 | 10.8% |
| 27 | 9.4% | |
| e | 25 | 8.7% |
| s | 19 | 6.6% |
| o | 15 | 5.2% |
| i | 14 | 4.9% |
| , | 14 | 4.9% |
| n | 13 | 4.5% |
| r | 13 | 4.5% |
| Other values (22) | 83 |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 40.0% |
| Missing | 2361468 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 11.2 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 20.0% |
Sample
| 1st row | ASIA |
|---|---|
| 2nd row | NORTH_AMERICA |
| 3rd row | NORTH_AMERICA |
| 4th row | NORTH_AMERICA |
| 5th row | NORTH_AMERICA |
| Value | Count | Frequency (%) |
| north_america | 4 | |
| asia | 1 | 20.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 10 | |
| R | 8 | |
| I | 5 | |
| N | 4 | 7.1% |
| O | 4 | 7.1% |
| T | 4 | 7.1% |
| H | 4 | 7.1% |
| _ | 4 | 7.1% |
| M | 4 | 7.1% |
| E | 4 | 7.1% |
| Other values (2) | 5 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 52 | |
| Connector Punctuation | 4 | 7.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 10 | |
| R | 8 | |
| I | 5 | |
| N | 4 | 7.7% |
| O | 4 | 7.7% |
| T | 4 | 7.7% |
| H | 4 | 7.7% |
| M | 4 | 7.7% |
| E | 4 | 7.7% |
| C | 4 | 7.7% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 52 | |
| Common | 4 | 7.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 10 | |
| R | 8 | |
| I | 5 | |
| N | 4 | 7.7% |
| O | 4 | 7.7% |
| T | 4 | 7.7% |
| H | 4 | 7.7% |
| M | 4 | 7.7% |
| E | 4 | 7.7% |
| C | 4 | 7.7% |
Common
| Value | Count | Frequency (%) |
| _ | 4 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 56 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 10 | |
| R | 8 | |
| I | 5 | |
| N | 4 | 7.1% |
| O | 4 | 7.1% |
| T | 4 | 7.1% |
| H | 4 | 7.1% |
| _ | 4 | 7.1% |
| M | 4 | 7.1% |
| E | 4 | 7.1% |
| Other values (2) | 5 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361472 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 20 |
| Mean length | 20 |
| Min length | 20 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | North Atlantic Ocean |
|---|
| Value | Count | Frequency (%) |
| north | 1 | |
| atlantic | 1 | |
| ocean | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 3 | |
| 2 | ||
| a | 2 | |
| n | 2 | |
| c | 2 | |
| N | 1 | 5.0% |
| o | 1 | 5.0% |
| r | 1 | 5.0% |
| h | 1 | 5.0% |
| A | 1 | 5.0% |
| Other values (4) | 4 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 15 | |
| Uppercase Letter | 3 | 15.0% |
| Space Separator | 2 | 10.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 3 | |
| a | 2 | |
| n | 2 | |
| c | 2 | |
| o | 1 | 6.7% |
| r | 1 | 6.7% |
| h | 1 | 6.7% |
| l | 1 | 6.7% |
| i | 1 | 6.7% |
| e | 1 | 6.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1 | |
| A | 1 | |
| O | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 18 | |
| Common | 2 | 10.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 3 | |
| a | 2 | |
| n | 2 | |
| c | 2 | |
| N | 1 | 5.6% |
| o | 1 | 5.6% |
| r | 1 | 5.6% |
| h | 1 | 5.6% |
| A | 1 | 5.6% |
| l | 1 | 5.6% |
| Other values (3) | 3 |
Common
| Value | Count | Frequency (%) |
| 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 3 | |
| 2 | ||
| a | 2 | |
| n | 2 | |
| c | 2 | |
| N | 1 | 5.0% |
| o | 1 | 5.0% |
| r | 1 | 5.0% |
| h | 1 | 5.0% |
| A | 1 | 5.0% |
| Other values (4) | 4 |
lowestBiostratigraphicZone
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361472 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Scharf, U. |
|---|
| Value | Count | Frequency (%) |
| scharf | 1 | |
| u | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 1 | |
| c | 1 | |
| h | 1 | |
| a | 1 | |
| r | 1 | |
| f | 1 | |
| , | 1 | |
| 1 | ||
| U | 1 | |
| . | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5 | |
| Uppercase Letter | 2 | 20.0% |
| Other Punctuation | 2 | 20.0% |
| Space Separator | 1 | 10.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 1 | |
| h | 1 | |
| a | 1 | |
| r | 1 | |
| f | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1 | |
| U | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1 | |
| . | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7 | |
| Common | 3 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 1 | |
| c | 1 | |
| h | 1 | |
| a | 1 | |
| r | 1 | |
| f | 1 | |
| U | 1 |
Common
| Value | Count | Frequency (%) |
| , | 1 | |
| 1 | ||
| . | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 1 | |
| c | 1 | |
| h | 1 | |
| a | 1 | |
| r | 1 | |
| f | 1 | |
| , | 1 | |
| 1 | ||
| U | 1 | |
| . | 1 |
highestBiostratigraphicZone
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361470 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 58 |
|---|---|
| Median length | 7 |
| Mean length | 24 |
| Min length | 7 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Drosera |
|---|---|
| 2nd row | North America, Mexico, Baja California Norte, Guadalupe I. |
| 3rd row | Miconia |
| Value | Count | Frequency (%) |
| drosera | 1 | |
| north | 1 | |
| america | 1 | |
| mexico | 1 | |
| baja | 1 | |
| california | 1 | |
| norte | 1 | |
| guadalupe | 1 | |
| i | 1 | |
| miconia | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 9 | 12.5% |
| 7 | 9.7% | |
| i | 6 | 8.3% |
| o | 6 | 8.3% |
| r | 6 | 8.3% |
| e | 5 | 6.9% |
| c | 3 | 4.2% |
| , | 3 | 4.2% |
| l | 2 | 2.8% |
| u | 2 | 2.8% |
| Other values (19) | 23 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 51 | |
| Uppercase Letter | 10 | 13.9% |
| Space Separator | 7 | 9.7% |
| Other Punctuation | 4 | 5.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 9 | |
| i | 6 | |
| o | 6 | |
| r | 6 | |
| e | 5 | |
| c | 3 | 5.9% |
| l | 2 | 3.9% |
| u | 2 | 3.9% |
| t | 2 | 3.9% |
| n | 2 | 3.9% |
| Other values (8) | 8 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 2 | |
| M | 2 | |
| I | 1 | |
| G | 1 | |
| D | 1 | |
| C | 1 | |
| B | 1 | |
| A | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 3 | |
| . | 1 | 25.0% |
Space Separator
| Value | Count | Frequency (%) |
| 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 61 | |
| Common | 11 | 15.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 9 | |
| i | 6 | 9.8% |
| o | 6 | 9.8% |
| r | 6 | 9.8% |
| e | 5 | 8.2% |
| c | 3 | 4.9% |
| l | 2 | 3.3% |
| u | 2 | 3.3% |
| t | 2 | 3.3% |
| N | 2 | 3.3% |
| Other values (16) | 18 |
Common
| Value | Count | Frequency (%) |
| 7 | ||
| , | 3 | |
| . | 1 | 9.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 72 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 9 | 12.5% |
| 7 | 9.7% | |
| i | 6 | 8.3% |
| o | 6 | 8.3% |
| r | 6 | 8.3% |
| e | 5 | 6.9% |
| c | 3 | 4.2% |
| , | 3 | 4.2% |
| l | 2 | 2.8% |
| u | 2 | 2.8% |
| Other values (19) | 23 |
Missing 
| Distinct | 6 |
|---|---|
| Distinct (%) | 60.0% |
| Missing | 2361463 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 25 |
|---|---|
| Median length | 2 |
| Mean length | 6.4 |
| Min length | 2 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 50.0% |
Sample
| 1st row | Drosera |
|---|---|
| 2nd row | TW |
| 3rd row | US |
| 4th row | Campanula rotundifolia L. |
| 5th row | US |
| Value | Count | Frequency (%) |
| us | 5 | |
| drosera | 1 | 8.3% |
| tw | 1 | 8.3% |
| campanula | 1 | 8.3% |
| rotundifolia | 1 | 8.3% |
| l | 1 | 8.3% |
| north_america | 1 | 8.3% |
| miconia | 1 | 8.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 6 | 9.4% |
| U | 5 | 7.8% |
| S | 5 | 7.8% |
| o | 4 | 6.2% |
| i | 4 | 6.2% |
| n | 3 | 4.7% |
| r | 3 | 4.7% |
| M | 2 | 3.1% |
| A | 2 | 3.1% |
| R | 2 | 3.1% |
| Other values (23) | 28 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 32 | |
| Uppercase Letter | 28 | |
| Space Separator | 2 | 3.1% |
| Connector Punctuation | 1 | 1.6% |
| Other Punctuation | 1 | 1.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 6 | |
| o | 4 | |
| i | 4 | |
| n | 3 | |
| r | 3 | |
| l | 2 | 6.2% |
| u | 2 | 6.2% |
| p | 1 | 3.1% |
| s | 1 | 3.1% |
| e | 1 | 3.1% |
| Other values (5) | 5 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 5 | |
| S | 5 | |
| M | 2 | 7.1% |
| A | 2 | 7.1% |
| R | 2 | 7.1% |
| C | 2 | 7.1% |
| T | 2 | 7.1% |
| O | 1 | 3.6% |
| I | 1 | 3.6% |
| E | 1 | 3.6% |
| Other values (5) | 5 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 60 | |
| Common | 4 | 6.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 6 | 10.0% |
| U | 5 | 8.3% |
| S | 5 | 8.3% |
| o | 4 | 6.7% |
| i | 4 | 6.7% |
| n | 3 | 5.0% |
| r | 3 | 5.0% |
| M | 2 | 3.3% |
| A | 2 | 3.3% |
| R | 2 | 3.3% |
| Other values (20) | 24 |
Common
| Value | Count | Frequency (%) |
| 2 | ||
| _ | 1 | |
| . | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 64 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 6 | 9.4% |
| U | 5 | 7.8% |
| S | 5 | 7.8% |
| o | 4 | 6.2% |
| i | 4 | 6.2% |
| n | 3 | 4.7% |
| r | 3 | 4.7% |
| M | 2 | 3.1% |
| A | 2 | 3.1% |
| R | 2 | 3.1% |
| Other values (23) | 28 |
group
Text
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 80.0% |
| Missing | 2361468 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 8 |
| Mean length | 9.4 |
| Min length | 6 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 60.0% |
Sample
| 1st row | Oklahoma |
|---|---|
| 2nd row | Alaska |
| 3rd row | Massachusetts |
| 4th row | Arizona |
| 5th row | Massachusetts |
| Value | Count | Frequency (%) |
| massachusetts | 2 | |
| oklahoma | 1 | |
| alaska | 1 | |
| arizona | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 9 | |
| s | 9 | |
| t | 4 | 8.5% |
| h | 3 | 6.4% |
| M | 2 | 4.3% |
| A | 2 | 4.3% |
| o | 2 | 4.3% |
| l | 2 | 4.3% |
| k | 2 | 4.3% |
| e | 2 | 4.3% |
| Other values (8) | 10 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 42 | |
| Uppercase Letter | 5 | 10.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 9 | |
| s | 9 | |
| t | 4 | |
| h | 3 | 7.1% |
| o | 2 | 4.8% |
| l | 2 | 4.8% |
| k | 2 | 4.8% |
| e | 2 | 4.8% |
| u | 2 | 4.8% |
| c | 2 | 4.8% |
| Other values (5) | 5 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 2 | |
| A | 2 | |
| O | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 47 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 9 | |
| s | 9 | |
| t | 4 | 8.5% |
| h | 3 | 6.4% |
| M | 2 | 4.3% |
| A | 2 | 4.3% |
| o | 2 | 4.3% |
| l | 2 | 4.3% |
| k | 2 | 4.3% |
| e | 2 | 4.3% |
| Other values (8) | 10 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 47 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 9 | |
| s | 9 | |
| t | 4 | 8.5% |
| h | 3 | 6.4% |
| M | 2 | 4.3% |
| A | 2 | 4.3% |
| o | 2 | 4.3% |
| l | 2 | 4.3% |
| k | 2 | 4.3% |
| e | 2 | 4.3% |
| Other values (8) | 10 |
formation
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361470 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 15 |
| Mean length | 13 |
| Min length | 7 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Pontotoc County |
|---|---|
| 2nd row | Cochise |
| 3rd row | Barnstable County |
| Value | Count | Frequency (%) |
| county | 2 | |
| pontotoc | 1 | |
| cochise | 1 | |
| barnstable | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 6 | |
| t | 5 | |
| n | 4 | |
| C | 3 | 7.7% |
| a | 2 | 5.1% |
| c | 2 | 5.1% |
| 2 | 5.1% | |
| u | 2 | 5.1% |
| y | 2 | 5.1% |
| s | 2 | 5.1% |
| Other values (8) | 9 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 32 | |
| Uppercase Letter | 5 | 12.8% |
| Space Separator | 2 | 5.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 6 | |
| t | 5 | |
| n | 4 | |
| a | 2 | 6.2% |
| c | 2 | 6.2% |
| u | 2 | 6.2% |
| y | 2 | 6.2% |
| s | 2 | 6.2% |
| e | 2 | 6.2% |
| b | 1 | 3.1% |
| Other values (4) | 4 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 3 | |
| P | 1 | 20.0% |
| B | 1 | 20.0% |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 37 | |
| Common | 2 | 5.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 6 | |
| t | 5 | |
| n | 4 | |
| C | 3 | |
| a | 2 | 5.4% |
| c | 2 | 5.4% |
| u | 2 | 5.4% |
| y | 2 | 5.4% |
| s | 2 | 5.4% |
| e | 2 | 5.4% |
| Other values (7) | 7 |
Common
| Value | Count | Frequency (%) |
| 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 39 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 6 | |
| t | 5 | |
| n | 4 | |
| C | 3 | 7.7% |
| a | 2 | 5.1% |
| c | 2 | 5.1% |
| 2 | 5.1% | |
| u | 2 | 5.1% |
| y | 2 | 5.1% |
| s | 2 | 5.1% |
| Other values (8) | 9 |
member
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361471 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 8 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Guadalupe I. |
|---|---|
| 2nd row | coronata |
| Value | Count | Frequency (%) |
| guadalupe | 1 | |
| i | 1 | |
| coronata | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4 | |
| u | 2 | 10.0% |
| o | 2 | 10.0% |
| G | 1 | 5.0% |
| d | 1 | 5.0% |
| l | 1 | 5.0% |
| p | 1 | 5.0% |
| e | 1 | 5.0% |
| 1 | 5.0% | |
| I | 1 | 5.0% |
| Other values (5) | 5 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 16 | |
| Uppercase Letter | 2 | 10.0% |
| Space Separator | 1 | 5.0% |
| Other Punctuation | 1 | 5.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4 | |
| u | 2 | |
| o | 2 | |
| d | 1 | 6.2% |
| l | 1 | 6.2% |
| p | 1 | 6.2% |
| e | 1 | 6.2% |
| c | 1 | 6.2% |
| r | 1 | 6.2% |
| n | 1 | 6.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 1 | |
| I | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 18 | |
| Common | 2 | 10.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4 | |
| u | 2 | |
| o | 2 | |
| G | 1 | 5.6% |
| d | 1 | 5.6% |
| l | 1 | 5.6% |
| p | 1 | 5.6% |
| e | 1 | 5.6% |
| I | 1 | 5.6% |
| c | 1 | 5.6% |
| Other values (3) | 3 |
Common
| Value | Count | Frequency (%) |
| 1 | ||
| . | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4 | |
| u | 2 | 10.0% |
| o | 2 | 10.0% |
| G | 1 | 5.0% |
| d | 1 | 5.0% |
| l | 1 | 5.0% |
| p | 1 | 5.0% |
| e | 1 | 5.0% |
| 1 | 5.0% | |
| I | 1 | 5.0% |
| Other values (5) | 5 |
bed
Text
Missing 
| Distinct | 6 |
|---|---|
| Distinct (%) | 85.7% |
| Missing | 2361466 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 49 |
|---|---|
| Median length | 10 |
| Mean length | 12.85714286 |
| Min length | 2 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 71.4% |
Sample
| 1st row | Ping-lin |
|---|---|
| 2nd row | Ada |
| 3rd row | Seldovia |
| 4th row | Woods Hole |
| 5th row | MX |
| Value | Count | Frequency (%) |
| woods | 2 | |
| hole | 2 | |
| ping-lin | 1 | |
| ada | 1 | |
| seldovia | 1 | |
| mx | 1 | |
| chiricahua | 1 | |
| mountains | 1 | |
| barfoot | 1 | |
| park | 1 | |
| Other values (2) | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 12 | 13.3% |
| a | 7 | 7.8% |
| 7 | 7.8% | |
| l | 6 | 6.7% |
| i | 6 | 6.7% |
| n | 6 | 6.7% |
| s | 5 | 5.6% |
| d | 4 | 4.4% |
| r | 3 | 3.3% |
| e | 3 | 3.3% |
| Other values (21) | 31 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 66 | |
| Uppercase Letter | 13 | 14.4% |
| Space Separator | 7 | 7.8% |
| Other Punctuation | 3 | 3.3% |
| Dash Punctuation | 1 | 1.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 12 | |
| a | 7 | |
| l | 6 | |
| i | 6 | |
| n | 6 | |
| s | 5 | |
| d | 4 | 6.1% |
| r | 3 | 4.5% |
| e | 3 | 4.5% |
| t | 3 | 4.5% |
| Other values (8) | 11 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 2 | |
| W | 2 | |
| P | 2 | |
| H | 2 | |
| B | 1 | |
| S | 1 | |
| C | 1 | |
| X | 1 | |
| A | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 2 | |
| . | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 7 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 79 | |
| Common | 11 | 12.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 12 | |
| a | 7 | 8.9% |
| l | 6 | 7.6% |
| i | 6 | 7.6% |
| n | 6 | 7.6% |
| s | 5 | 6.3% |
| d | 4 | 5.1% |
| r | 3 | 3.8% |
| e | 3 | 3.8% |
| t | 3 | 3.8% |
| Other values (17) | 24 |
Common
| Value | Count | Frequency (%) |
| 7 | ||
| , | 2 | 18.2% |
| - | 1 | 9.1% |
| . | 1 | 9.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 90 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 12 | 13.3% |
| a | 7 | 7.8% |
| 7 | 7.8% | |
| l | 6 | 6.7% |
| i | 6 | 6.7% |
| n | 6 | 6.7% |
| s | 5 | 5.6% |
| d | 4 | 4.4% |
| r | 3 | 3.3% |
| e | 3 | 3.3% |
| Other values (21) | 31 |
identificationID
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361472 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 21 |
| Mean length | 21 |
| Min length | 21 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Baja California Norte |
|---|
| Value | Count | Frequency (%) |
| baja | 1 | |
| california | 1 | |
| norte | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4 | |
| 2 | ||
| i | 2 | |
| o | 2 | |
| r | 2 | |
| B | 1 | 4.8% |
| j | 1 | 4.8% |
| C | 1 | 4.8% |
| l | 1 | 4.8% |
| f | 1 | 4.8% |
| Other values (4) | 4 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 16 | |
| Uppercase Letter | 3 | 14.3% |
| Space Separator | 2 | 9.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4 | |
| i | 2 | |
| o | 2 | |
| r | 2 | |
| j | 1 | 6.2% |
| l | 1 | 6.2% |
| f | 1 | 6.2% |
| n | 1 | 6.2% |
| t | 1 | 6.2% |
| e | 1 | 6.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 1 | |
| C | 1 | |
| N | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 19 | |
| Common | 2 | 9.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4 | |
| i | 2 | |
| o | 2 | |
| r | 2 | |
| B | 1 | 5.3% |
| j | 1 | 5.3% |
| C | 1 | 5.3% |
| l | 1 | 5.3% |
| f | 1 | 5.3% |
| n | 1 | 5.3% |
| Other values (3) | 3 |
Common
| Value | Count | Frequency (%) |
| 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4 | |
| 2 | ||
| i | 2 | |
| o | 2 | |
| r | 2 | |
| B | 1 | 4.8% |
| j | 1 | 4.8% |
| C | 1 | 4.8% |
| l | 1 | 4.8% |
| f | 1 | 4.8% |
| Other values (4) | 4 |
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361470 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 6.333333333 |
| Min length | 5 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | GENUS |
|---|---|
| 2nd row | 3155772 |
| 3rd row | SPECIES |
| Value | Count | Frequency (%) |
| genus | 1 | |
| 3155772 | 1 | |
| species | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 3 | |
| S | 3 | |
| 5 | 2 | |
| 7 | 2 | |
| G | 1 | 5.3% |
| N | 1 | 5.3% |
| U | 1 | 5.3% |
| 3 | 1 | 5.3% |
| 1 | 1 | 5.3% |
| 2 | 1 | 5.3% |
| Other values (3) | 3 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 12 | |
| Decimal Number | 7 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 3 | |
| S | 3 | |
| G | 1 | 8.3% |
| N | 1 | 8.3% |
| U | 1 | 8.3% |
| P | 1 | 8.3% |
| C | 1 | 8.3% |
| I | 1 | 8.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 2 | |
| 7 | 2 | |
| 3 | 1 | |
| 1 | 1 | |
| 2 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12 | |
| Common | 7 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 3 | |
| S | 3 | |
| G | 1 | 8.3% |
| N | 1 | 8.3% |
| U | 1 | 8.3% |
| P | 1 | 8.3% |
| C | 1 | 8.3% |
| I | 1 | 8.3% |
Common
| Value | Count | Frequency (%) |
| 5 | 2 | |
| 7 | 2 | |
| 3 | 1 | |
| 1 | 1 | |
| 2 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 3 | |
| S | 3 | |
| 5 | 2 | |
| 7 | 2 | |
| G | 1 | 5.3% |
| N | 1 | 5.3% |
| U | 1 | 5.3% |
| 3 | 1 | 5.3% |
| 1 | 1 | 5.3% |
| 2 | 1 | 5.3% |
| Other values (3) | 3 |
Missing 
| Distinct | 24 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 2352474 |
| Missing (%) | 99.6% |
| Memory size | 18.0 MiB |
Length
| Max length | 64 |
|---|---|
| Median length | 3 |
| Mean length | 4.292810312 |
| Min length | 2 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | near |
|---|---|
| 2nd row | cf. |
| 3rd row | cf. |
| 4th row | vel aff. |
| 5th row | vel aff. |
| Value | Count | Frequency (%) |
| cf | 6018 | |
| uncertain | 1593 | 17.4% |
| aff | 939 | 10.3% |
| near | 259 | 2.8% |
| s.l | 121 | 1.3% |
| vel | 93 | 1.0% |
| group | 24 | 0.3% |
| subgroup | 23 | 0.3% |
| sp | 21 | 0.2% |
| nov | 15 | 0.2% |
| Other values (12) | 29 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| f | 7896 | |
| c | 7622 | |
| . | 7199 | |
| n | 3471 | |
| a | 2802 | 7.3% |
| e | 1963 | 5.1% |
| r | 1904 | 4.9% |
| u | 1624 | 4.2% |
| t | 1602 | 4.1% |
| i | 1598 | 4.1% |
| Other values (19) | 950 | 2.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 31242 | |
| Other Punctuation | 7203 | 18.6% |
| Space Separator | 136 | 0.4% |
| Uppercase Letter | 50 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| f | 7896 | |
| c | 7622 | |
| n | 3471 | |
| a | 2802 | 9.0% |
| e | 1963 | 6.3% |
| r | 1904 | 6.1% |
| u | 1624 | 5.2% |
| t | 1602 | 5.1% |
| i | 1598 | 5.1% |
| l | 225 | 0.7% |
| Other values (10) | 535 | 1.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 44 | |
| C | 2 | 4.0% |
| B | 1 | 2.0% |
| P | 1 | 2.0% |
| D | 1 | 2.0% |
| A | 1 | 2.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 7199 | |
| , | 4 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 136 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 31292 | |
| Common | 7339 | 19.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| f | 7896 | |
| c | 7622 | |
| n | 3471 | |
| a | 2802 | 9.0% |
| e | 1963 | 6.3% |
| r | 1904 | 6.1% |
| u | 1624 | 5.2% |
| t | 1602 | 5.1% |
| i | 1598 | 5.1% |
| l | 225 | 0.7% |
| Other values (16) | 585 | 1.9% |
Common
| Value | Count | Frequency (%) |
| . | 7199 | |
| 136 | 1.9% | |
| , | 4 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38631 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| f | 7896 | |
| c | 7622 | |
| . | 7199 | |
| n | 3471 | |
| a | 2802 | 7.3% |
| e | 1963 | 5.1% |
| r | 1904 | 4.9% |
| u | 1624 | 4.2% |
| t | 1602 | 4.1% |
| i | 1598 | 4.1% |
| Other values (19) | 950 | 2.5% |
typeStatus
Text
Missing 
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2274525 |
| Missing (%) | 96.3% |
| Memory size | 18.0 MiB |
Length
| Max length | 34 |
|---|---|
| Median length | 8 |
| Mean length | 7.268436307 |
| Min length | 4 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | TYPE |
|---|---|
| 2nd row | HOLOTYPE |
| 3rd row | TYPE |
| 4th row | HOLOTYPE |
| 5th row | HOLOTYPE |
| Value | Count | Frequency (%) |
| holotype | 26480 | |
| paratype | 19158 | |
| isotype | 15389 | |
| type | 12139 | |
| syntype | 7658 | 8.8% |
| lectotype | 1798 | 2.1% |
| isosyntype | 1550 | 1.8% |
| allotype | 997 | 1.1% |
| isolectotype | 518 | 0.6% |
| cotype | 484 | 0.6% |
| Other values (13) | 780 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 106503 | |
| Y | 96139 | |
| E | 89952 | |
| T | 89665 | |
| O | 75084 | |
| A | 40178 | 6.4% |
| L | 31157 | 4.9% |
| S | 26767 | 4.2% |
| H | 26546 | 4.2% |
| R | 19532 | 3.1% |
| Other values (23) | 30453 | 4.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 631940 | |
| Lowercase Letter | 31 | < 0.1% |
| Space Separator | 3 | < 0.1% |
| Other Punctuation | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 106503 | |
| Y | 96139 | |
| E | 89952 | |
| T | 89665 | |
| O | 75084 | |
| A | 40178 | 6.4% |
| L | 31157 | 4.9% |
| S | 26767 | 4.2% |
| H | 26546 | 4.2% |
| R | 19532 | 3.1% |
| Other values (6) | 30417 | 4.8% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 9 | |
| l | 4 | |
| n | 3 | 9.7% |
| e | 2 | 6.5% |
| u | 2 | 6.5% |
| d | 2 | 6.5% |
| i | 2 | 6.5% |
| j | 1 | 3.2% |
| r | 1 | 3.2% |
| o | 1 | 3.2% |
| Other values (4) | 4 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1 | |
| . | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 631971 | |
| Common | 5 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| P | 106503 | |
| Y | 96139 | |
| E | 89952 | |
| T | 89665 | |
| O | 75084 | |
| A | 40178 | 6.4% |
| L | 31157 | 4.9% |
| S | 26767 | 4.2% |
| H | 26546 | 4.2% |
| R | 19532 | 3.1% |
| Other values (20) | 30448 | 4.8% |
Common
| Value | Count | Frequency (%) |
| 3 | ||
| , | 1 | 20.0% |
| . | 1 | 20.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 631976 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| P | 106503 | |
| Y | 96139 | |
| E | 89952 | |
| T | 89665 | |
| O | 75084 | |
| A | 40178 | 6.4% |
| L | 31157 | 4.9% |
| S | 26767 | 4.2% |
| H | 26546 | 4.2% |
| R | 19532 | 3.1% |
| Other values (23) | 30453 | 4.8% |
identifiedBy
Text
Missing 
| Distinct | 15491 |
|---|---|
| Distinct (%) | 3.8% |
| Missing | 1955406 |
| Missing (%) | 82.8% |
| Memory size | 18.0 MiB |
Length
| Max length | 215 |
|---|---|
| Median length | 136 |
| Mean length | 36.86477601 |
| Min length | 2 |
Unique
| Unique | 5875 ? |
|---|---|
| Unique (%) | 1.4% |
Sample
| 1st row | Badley, J. E. |
|---|---|
| 2nd row | Strong, M. T., (US), Smithsonian Institution - National Museum of Natural History (UNITED STATES) |
| 3rd row | Johnson, M. W. |
| 4th row | Zibrowius, Helmut, (CNRS-UA 41), Centre d'Oceanologie de Marseille (CNRS-UA 41) (FRANCE) |
| 5th row | Foster, W. D. |
| Value | Count | Frequency (%) |
| of | 101962 | 4.6% |
| museum | 88049 | 3.9% |
| national | 87412 | 3.9% |
| institution | 84694 | 3.8% |
| smithsonian | 84068 | 3.8% |
| natural | 83876 | 3.8% |
| history | 83747 | 3.7% |
| united | 76183 | 3.4% |
| states | 75967 | 3.4% |
| 60502 | 2.7% | |
| Other values (11543) | 1407833 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1828226 | 12.2% | |
| a | 896479 | 6.0% |
| t | 890147 | 5.9% |
| i | 878253 | 5.9% |
| n | 819247 | 5.5% |
| o | 804702 | 5.4% |
| e | 660704 | 4.4% |
| , | 639172 | 4.3% |
| r | 634090 | 4.2% |
| s | 583168 | 3.9% |
| Other values (97) | 6335381 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8553794 | |
| Uppercase Letter | 3024063 | 20.2% |
| Space Separator | 1828226 | 12.2% |
| Other Punctuation | 1178772 | 7.9% |
| Open Punctuation | 155751 | 1.0% |
| Close Punctuation | 155751 | 1.0% |
| Dash Punctuation | 71705 | 0.5% |
| Decimal Number | 1459 | < 0.1% |
| Math Symbol | 25 | < 0.1% |
| Initial Punctuation | 11 | < 0.1% |
| Other values (2) | 12 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 896479 | |
| t | 890147 | |
| i | 878253 | |
| n | 819247 | |
| o | 804702 | |
| e | 660704 | |
| r | 634090 | |
| s | 583168 | 6.8% |
| u | 486436 | 5.7% |
| l | 429180 | 5.0% |
| Other values (41) | 1471388 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 355379 | |
| T | 298710 | 9.9% |
| N | 288522 | 9.5% |
| E | 221718 | 7.3% |
| M | 209359 | 6.9% |
| I | 204015 | 6.7% |
| A | 174366 | 5.8% |
| H | 172820 | 5.7% |
| D | 153344 | 5.1% |
| U | 121907 | 4.0% |
| Other values (20) | 823923 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 639172 | |
| . | 508849 | |
| ; | 23149 | 2.0% |
| / | 4267 | 0.4% |
| & | 1664 | 0.1% |
| ' | 1327 | 0.1% |
| " | 332 | < 0.1% |
| ¡ | 8 | < 0.1% |
| ? | 4 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 673 | |
| 4 | 672 | |
| 2 | 45 | 3.1% |
| 9 | 24 | 1.6% |
| 0 | 23 | 1.6% |
| 6 | 22 | 1.5% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 155163 | |
| [ | 588 | 0.4% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 155163 | |
| ] | 588 | 0.4% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 71702 | |
| – | 3 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1828226 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 25 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 11 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 11 |
Currency Symbol
| Value | Count | Frequency (%) |
| ¢ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11577857 | |
| Common | 3391712 | 22.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 896479 | 7.7% |
| t | 890147 | 7.7% |
| i | 878253 | 7.6% |
| n | 819247 | 7.1% |
| o | 804702 | 7.0% |
| e | 660704 | 5.7% |
| r | 634090 | 5.5% |
| s | 583168 | 5.0% |
| u | 486436 | 4.2% |
| l | 429180 | 3.7% |
| Other values (71) | 4495451 |
Common
| Value | Count | Frequency (%) |
| 1828226 | ||
| , | 639172 | 18.8% |
| . | 508849 | 15.0% |
| ( | 155163 | 4.6% |
| ) | 155163 | 4.6% |
| - | 71702 | 2.1% |
| ; | 23149 | 0.7% |
| / | 4267 | 0.1% |
| & | 1664 | < 0.1% |
| ' | 1327 | < 0.1% |
| Other values (16) | 3030 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14959867 | |
| None | 9677 | 0.1% |
| Punctuation | 25 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1828226 | 12.2% | |
| a | 896479 | 6.0% |
| t | 890147 | 6.0% |
| i | 878253 | 5.9% |
| n | 819247 | 5.5% |
| o | 804702 | 5.4% |
| e | 660704 | 4.4% |
| , | 639172 | 4.3% |
| r | 634090 | 4.2% |
| s | 583168 | 3.9% |
| Other values (63) | 6325679 |
None
| Value | Count | Frequency (%) |
| í | 5003 | |
| é | 1178 | 12.2% |
| á | 1173 | 12.1% |
| ñ | 499 | 5.2% |
| ö | 450 | 4.7% |
| ü | 301 | 3.1% |
| ó | 273 | 2.8% |
| ä | 216 | 2.2% |
| ã | 195 | 2.0% |
| ú | 82 | 0.8% |
| Other values (21) | 307 | 3.2% |
Punctuation
| Value | Count | Frequency (%) |
| “ | 11 | |
| ” | 11 | |
| – | 3 | 12.0% |
identifiedByID
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 66.7% |
| Missing | 2361470 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 8 |
| Mean length | 9.666666667 |
| Min length | 8 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 33.3% |
Sample
| 1st row | ACCEPTED |
|---|---|
| 2nd row | Magnoliopsida |
| 3rd row | ACCEPTED |
| Value | Count | Frequency (%) |
| accepted | 2 | |
| magnoliopsida | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 4 | |
| E | 4 | |
| A | 2 | 6.9% |
| P | 2 | 6.9% |
| T | 2 | 6.9% |
| D | 2 | 6.9% |
| a | 2 | 6.9% |
| o | 2 | 6.9% |
| i | 2 | 6.9% |
| M | 1 | 3.4% |
| Other values (6) | 6 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 17 | |
| Lowercase Letter | 12 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2 | |
| o | 2 | |
| i | 2 | |
| g | 1 | |
| n | 1 | |
| l | 1 | |
| p | 1 | |
| s | 1 | |
| d | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 4 | |
| E | 4 | |
| A | 2 | |
| P | 2 | |
| T | 2 | |
| D | 2 | |
| M | 1 | 5.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 29 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 4 | |
| E | 4 | |
| A | 2 | 6.9% |
| P | 2 | 6.9% |
| T | 2 | 6.9% |
| D | 2 | 6.9% |
| a | 2 | 6.9% |
| o | 2 | 6.9% |
| i | 2 | 6.9% |
| M | 1 | 3.4% |
| Other values (6) | 6 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 29 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 4 | |
| E | 4 | |
| A | 2 | 6.9% |
| P | 2 | 6.9% |
| T | 2 | 6.9% |
| D | 2 | 6.9% |
| a | 2 | 6.9% |
| o | 2 | 6.9% |
| i | 2 | 6.9% |
| M | 1 | 3.4% |
| Other values (6) | 6 |
dateIdentified
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361472 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Asterales |
|---|
| Value | Count | Frequency (%) |
| asterales | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 2 | |
| e | 2 | |
| A | 1 | |
| t | 1 | |
| r | 1 | |
| a | 1 | |
| l | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8 | |
| Uppercase Letter | 1 | 11.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 2 | |
| e | 2 | |
| t | 1 | |
| r | 1 | |
| a | 1 | |
| l | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 2 | |
| e | 2 | |
| A | 1 | |
| t | 1 | |
| r | 1 | |
| a | 1 | |
| l | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 2 | |
| e | 2 | |
| A | 1 | |
| t | 1 | |
| r | 1 | |
| a | 1 | |
| l | 1 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361472 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 37 |
|---|---|
| Median length | 37 |
| Mean length | 37 |
| Min length | 37 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Guatteria punctata (Aubl.) R.A.Howard |
|---|
| Value | Count | Frequency (%) |
| guatteria | 1 | |
| punctata | 1 | |
| aubl | 1 | |
| r.a.howard | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 5 | |
| t | 4 | 10.8% |
| 3 | 8.1% | |
| u | 3 | 8.1% |
| . | 3 | 8.1% |
| r | 2 | 5.4% |
| A | 2 | 5.4% |
| G | 1 | 2.7% |
| l | 1 | 2.7% |
| w | 1 | 2.7% |
| Other values (12) | 12 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 24 | |
| Uppercase Letter | 5 | 13.5% |
| Space Separator | 3 | 8.1% |
| Other Punctuation | 3 | 8.1% |
| Close Punctuation | 1 | 2.7% |
| Open Punctuation | 1 | 2.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 5 | |
| t | 4 | |
| u | 3 | |
| r | 2 | 8.3% |
| l | 1 | 4.2% |
| w | 1 | 4.2% |
| o | 1 | 4.2% |
| b | 1 | 4.2% |
| c | 1 | 4.2% |
| n | 1 | 4.2% |
| Other values (4) | 4 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2 | |
| G | 1 | |
| H | 1 | |
| R | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 3 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 29 | |
| Common | 8 | 21.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 5 | |
| t | 4 | |
| u | 3 | 10.3% |
| r | 2 | 6.9% |
| A | 2 | 6.9% |
| G | 1 | 3.4% |
| l | 1 | 3.4% |
| w | 1 | 3.4% |
| o | 1 | 3.4% |
| H | 1 | 3.4% |
| Other values (8) | 8 |
Common
| Value | Count | Frequency (%) |
| 3 | ||
| . | 3 | |
| ) | 1 | 12.5% |
| ( | 1 | 12.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 37 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 5 | |
| t | 4 | 10.8% |
| 3 | 8.1% | |
| u | 3 | 8.1% |
| . | 3 | 8.1% |
| r | 2 | 5.4% |
| A | 2 | 5.4% |
| G | 1 | 2.7% |
| l | 1 | 2.7% |
| w | 1 | 2.7% |
| Other values (12) | 12 |
identificationVerificationStatus
Text
Missing 
| Distinct | 6 |
|---|---|
| Distinct (%) | 85.7% |
| Missing | 2361466 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 7 |
| Mean length | 16.14285714 |
| Min length | 7 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 71.4% |
Sample
| 1st row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
|---|---|
| 2nd row | 24.9115 |
| 3rd row | 34.7745 |
| 4th row | Campanulaceae |
| 5th row | 59.4381 |
| Value | Count | Frequency (%) |
| 821cc27a-e3bb-4bc5-ac34-89ada245069d | 2 | |
| 24.9115 | 1 | |
| 34.7745 | 1 | |
| campanulaceae | 1 | |
| 59.4381 | 1 | |
| 41.5265 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 12 | 10.6% |
| 4 | 11 | 9.7% |
| 5 | 9 | 8.0% |
| c | 9 | 8.0% |
| - | 8 | 7.1% |
| 2 | 8 | 7.1% |
| 1 | 6 | 5.3% |
| 9 | 6 | 5.3% |
| 3 | 6 | 5.3% |
| b | 6 | 5.3% |
| Other values (13) | 32 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 60 | |
| Lowercase Letter | 40 | |
| Dash Punctuation | 8 | 7.1% |
| Other Punctuation | 4 | 3.5% |
| Uppercase Letter | 1 | 0.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 12 | |
| c | 9 | |
| b | 6 | |
| e | 4 | 10.0% |
| d | 4 | 10.0% |
| m | 1 | 2.5% |
| p | 1 | 2.5% |
| n | 1 | 2.5% |
| u | 1 | 2.5% |
| l | 1 | 2.5% |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 11 | |
| 5 | 9 | |
| 2 | 8 | |
| 1 | 6 | |
| 9 | 6 | |
| 3 | 6 | |
| 8 | 5 | |
| 7 | 4 | 6.7% |
| 6 | 3 | 5.0% |
| 0 | 2 | 3.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 72 | |
| Latin | 41 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 11 | |
| 5 | 9 | |
| - | 8 | |
| 2 | 8 | |
| 1 | 6 | |
| 9 | 6 | |
| 3 | 6 | |
| 8 | 5 | |
| 7 | 4 | 5.6% |
| . | 4 | 5.6% |
| Other values (2) | 5 |
Latin
| Value | Count | Frequency (%) |
| a | 12 | |
| c | 9 | |
| b | 6 | |
| e | 4 | 9.8% |
| d | 4 | 9.8% |
| C | 1 | 2.4% |
| m | 1 | 2.4% |
| p | 1 | 2.4% |
| n | 1 | 2.4% |
| u | 1 | 2.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 113 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 12 | 10.6% |
| 4 | 11 | 9.7% |
| 5 | 9 | 8.0% |
| c | 9 | 8.0% |
| - | 8 | 7.1% |
| 2 | 8 | 7.1% |
| 1 | 6 | 5.3% |
| 9 | 6 | 5.3% |
| 3 | 6 | 5.3% |
| b | 6 | 5.3% |
| Other values (13) | 32 |
Missing 
| Distinct | 5 |
|---|---|
| Distinct (%) | 83.3% |
| Missing | 2361467 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 5.666666667 |
| Min length | 2 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 66.7% |
Sample
| 1st row | US |
|---|---|
| 2nd row | 121.73 |
| 3rd row | -96.6783 |
| 4th row | -151.711 |
| 5th row | -70.6731 |
| Value | Count | Frequency (%) |
| us | 2 | |
| 121.73 | 1 | |
| 96.6783 | 1 | |
| 151.711 | 1 | |
| 70.6731 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 7 | |
| 7 | 5 | |
| . | 4 | |
| 3 | 3 | |
| - | 3 | |
| 6 | 3 | |
| U | 2 | 5.9% |
| S | 2 | 5.9% |
| 2 | 1 | 2.9% |
| 9 | 1 | 2.9% |
| Other values (3) | 3 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 23 | |
| Other Punctuation | 4 | 11.8% |
| Uppercase Letter | 4 | 11.8% |
| Dash Punctuation | 3 | 8.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 7 | |
| 7 | 5 | |
| 3 | 3 | |
| 6 | 3 | |
| 2 | 1 | 4.3% |
| 9 | 1 | 4.3% |
| 8 | 1 | 4.3% |
| 5 | 1 | 4.3% |
| 0 | 1 | 4.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 2 | |
| S | 2 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 30 | |
| Latin | 4 | 11.8% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 7 | |
| 7 | 5 | |
| . | 4 | |
| 3 | 3 | |
| - | 3 | |
| 6 | 3 | |
| 2 | 1 | 3.3% |
| 9 | 1 | 3.3% |
| 8 | 1 | 3.3% |
| 5 | 1 | 3.3% |
Latin
| Value | Count | Frequency (%) |
| U | 2 | |
| S | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 34 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 7 | |
| 7 | 5 | |
| . | 4 | |
| 3 | 3 | |
| - | 3 | |
| 6 | 3 | |
| U | 2 | 5.9% |
| S | 2 | 5.9% |
| 2 | 1 | 2.9% |
| 9 | 1 | 2.9% |
| Other values (3) | 3 |
taxonID
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361471 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 24 |
| Min length | 24 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 2024-12-02T13:57:05.005Z |
|---|---|
| 2nd row | 2024-12-02T13:57:45.829Z |
| Value | Count | Frequency (%) |
| 2024-12-02t13:57:05.005z | 1 | |
| 2024-12-02t13:57:45.829z | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 9 | |
| 0 | 7 | |
| 5 | 5 | |
| - | 4 | |
| 1 | 4 | |
| : | 4 | |
| 4 | 3 | 6.2% |
| T | 2 | 4.2% |
| 3 | 2 | 4.2% |
| 7 | 2 | 4.2% |
| Other values (4) | 6 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 34 | |
| Other Punctuation | 6 | 12.5% |
| Dash Punctuation | 4 | 8.3% |
| Uppercase Letter | 4 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 9 | |
| 0 | 7 | |
| 5 | 5 | |
| 1 | 4 | |
| 4 | 3 | 8.8% |
| 3 | 2 | 5.9% |
| 7 | 2 | 5.9% |
| 8 | 1 | 2.9% |
| 9 | 1 | 2.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 4 | |
| . | 2 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 2 | |
| Z | 2 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 44 | |
| Latin | 4 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 9 | |
| 0 | 7 | |
| 5 | 5 | |
| - | 4 | |
| 1 | 4 | |
| : | 4 | |
| 4 | 3 | 6.8% |
| 3 | 2 | 4.5% |
| 7 | 2 | 4.5% |
| . | 2 | 4.5% |
| Other values (2) | 2 | 4.5% |
Latin
| Value | Count | Frequency (%) |
| T | 2 | |
| Z | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 48 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 9 | |
| 0 | 7 | |
| 5 | 5 | |
| - | 4 | |
| 1 | 4 | |
| : | 4 | |
| 4 | 3 | 6.2% |
| T | 2 | 4.2% |
| 3 | 2 | 4.2% |
| 7 | 2 | 4.2% |
| Other values (4) | 6 |
scientificNameID
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361472 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 69.0 |
|---|
| Value | Count | Frequency (%) |
| 69.0 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 1 | |
| 9 | 1 | |
| . | 1 | |
| 0 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3 | |
| Other Punctuation | 1 | 25.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 1 | |
| 9 | 1 | |
| 0 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 6 | 1 | |
| 9 | 1 | |
| . | 1 | |
| 0 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6 | 1 | |
| 9 | 1 | |
| . | 1 | |
| 0 | 1 |
| Distinct | 315017 |
|---|---|
| Distinct (%) | 13.4% |
| Missing | 5774 |
| Missing (%) | 0.2% |
| Memory size | 18.0 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 6.879701948 |
| Min length | 1 |
Unique
| Unique | 142213 ? |
|---|---|
| Unique (%) | 6.0% |
Sample
| 1st row | 3869 |
|---|---|
| 2nd row | 3044413 |
| 3rd row | 2431199 |
| 4th row | 714 |
| 5th row | 2322812 |
| Value | Count | Frequency (%) |
| 2431491 | 19390 | 0.8% |
| 225 | 6083 | 0.3% |
| 7947184 | 4743 | 0.2% |
| 5967481 | 3865 | 0.2% |
| 2437967 | 3815 | 0.2% |
| 2431539 | 3260 | 0.1% |
| 2440447 | 2987 | 0.1% |
| 105 | 2810 | 0.1% |
| 1340278 | 2739 | 0.1% |
| 2431224 | 2562 | 0.1% |
| Other values (315007) | 2303445 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2439480 | |
| 3 | 1778291 | |
| 4 | 1658961 | |
| 1 | 1632514 | |
| 5 | 1571567 | |
| 7 | 1538551 | |
| 8 | 1409644 | |
| 6 | 1402728 | |
| 9 | 1400354 | |
| 0 | 1374408 | |
| Other values (7) | 9 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 16206498 | |
| Lowercase Letter | 8 | < 0.1% |
| Uppercase Letter | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2439480 | |
| 3 | 1778291 | |
| 4 | 1658961 | |
| 1 | 1632514 | |
| 5 | 1571567 | |
| 7 | 1538551 | |
| 8 | 1409644 | |
| 6 | 1402728 | |
| 9 | 1400354 | |
| 0 | 1374408 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3 | |
| m | 1 | 12.5% |
| p | 1 | 12.5% |
| n | 1 | 12.5% |
| u | 1 | 12.5% |
| l | 1 | 12.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 16206498 | |
| Latin | 9 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2439480 | |
| 3 | 1778291 | |
| 4 | 1658961 | |
| 1 | 1632514 | |
| 5 | 1571567 | |
| 7 | 1538551 | |
| 8 | 1409644 | |
| 6 | 1402728 | |
| 9 | 1400354 | |
| 0 | 1374408 |
Latin
| Value | Count | Frequency (%) |
| a | 3 | |
| C | 1 | 11.1% |
| m | 1 | 11.1% |
| p | 1 | 11.1% |
| n | 1 | 11.1% |
| u | 1 | 11.1% |
| l | 1 | 11.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16206507 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2439480 | |
| 3 | 1778291 | |
| 4 | 1658961 | |
| 1 | 1632514 | |
| 5 | 1571567 | |
| 7 | 1538551 | |
| 8 | 1409644 | |
| 6 | 1402728 | |
| 9 | 1400354 | |
| 0 | 1374408 | |
| Other values (7) | 9 | < 0.1% |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361472 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Campanula |
|---|
| Value | Count | Frequency (%) |
| campanula | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3 | |
| C | 1 | 11.1% |
| m | 1 | 11.1% |
| p | 1 | 11.1% |
| n | 1 | 11.1% |
| u | 1 | 11.1% |
| l | 1 | 11.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8 | |
| Uppercase Letter | 1 | 11.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3 | |
| m | 1 | 12.5% |
| p | 1 | 12.5% |
| n | 1 | 12.5% |
| u | 1 | 12.5% |
| l | 1 | 12.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3 | |
| C | 1 | 11.1% |
| m | 1 | 11.1% |
| p | 1 | 11.1% |
| n | 1 | 11.1% |
| u | 1 | 11.1% |
| l | 1 | 11.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3 | |
| C | 1 | 11.1% |
| m | 1 | 11.1% |
| p | 1 | 11.1% |
| n | 1 | 11.1% |
| u | 1 | 11.1% |
| l | 1 | 11.1% |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361472 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 68 |
|---|---|
| Median length | 68 |
| Mean length | 68 |
| Min length | 68 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Plantae, Dicotyledonae (basal), Magnoliales, Annonaceae, Annonoideae |
|---|
| Value | Count | Frequency (%) |
| plantae | 1 | |
| dicotyledonae | 1 | |
| basal | 1 | |
| magnoliales | 1 | |
| annonaceae | 1 | |
| annonoideae | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 10 | |
| n | 9 | |
| e | 8 | |
| o | 6 | |
| 5 | 7.4% | |
| l | 5 | 7.4% |
| , | 4 | 5.9% |
| i | 3 | 4.4% |
| c | 2 | 2.9% |
| s | 2 | 2.9% |
| Other values (11) | 14 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 52 | |
| Space Separator | 5 | 7.4% |
| Uppercase Letter | 5 | 7.4% |
| Other Punctuation | 4 | 5.9% |
| Open Punctuation | 1 | 1.5% |
| Close Punctuation | 1 | 1.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 10 | |
| n | 9 | |
| e | 8 | |
| o | 6 | |
| l | 5 | |
| i | 3 | 5.8% |
| c | 2 | 3.8% |
| s | 2 | 3.8% |
| d | 2 | 3.8% |
| t | 2 | 3.8% |
| Other values (3) | 3 | 5.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2 | |
| D | 1 | |
| M | 1 | |
| P | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 5 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 4 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 57 | |
| Common | 11 | 16.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 10 | |
| n | 9 | |
| e | 8 | |
| o | 6 | |
| l | 5 | |
| i | 3 | 5.3% |
| c | 2 | 3.5% |
| s | 2 | 3.5% |
| d | 2 | 3.5% |
| A | 2 | 3.5% |
| Other values (7) | 8 |
Common
| Value | Count | Frequency (%) |
| 5 | ||
| , | 4 | |
| ( | 1 | 9.1% |
| ) | 1 | 9.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 68 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 10 | |
| n | 9 | |
| e | 8 | |
| o | 6 | |
| 5 | 7.4% | |
| l | 5 | 7.4% |
| , | 4 | 5.9% |
| i | 3 | 4.4% |
| c | 2 | 2.9% |
| s | 2 | 2.9% |
| Other values (11) | 14 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361472 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Plantae |
|---|
| Value | Count | Frequency (%) |
| plantae | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2 | |
| P | 1 | |
| l | 1 | |
| n | 1 | |
| t | 1 | |
| e | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6 | |
| Uppercase Letter | 1 | 14.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2 | |
| l | 1 | |
| n | 1 | |
| t | 1 | |
| e | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2 | |
| P | 1 | |
| l | 1 | |
| n | 1 | |
| t | 1 | |
| e | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2 | |
| P | 1 | |
| l | 1 | |
| n | 1 | |
| t | 1 | |
| e | 1 |
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361469 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 100 |
|---|---|
| Median length | 74 |
| Mean length | 43 |
| Min length | 12 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;GEODETIC_DATUM_ASSUMED_WGS84;GEODETIC_DATUM_INVALID |
|---|---|
| 2nd row | rotundifolia |
| 3rd row | Tracheophyta |
| 4th row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT |
| Value | Count | Frequency (%) |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;geodetic_datum_invalid | 1 | |
| rotundifolia | 1 | |
| tracheophyta | 1 | |
| occurrence_status_inferred_from_individual_count | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| _ | 15 | 8.7% |
| E | 13 | 7.6% |
| D | 12 | 7.0% |
| I | 12 | 7.0% |
| T | 11 | 6.4% |
| U | 11 | 6.4% |
| C | 10 | 5.8% |
| R | 10 | 5.8% |
| N | 9 | 5.2% |
| A | 8 | 4.7% |
| Other values (26) | 61 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 130 | |
| Lowercase Letter | 23 | 13.4% |
| Connector Punctuation | 15 | 8.7% |
| Other Punctuation | 2 | 1.2% |
| Decimal Number | 2 | 1.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 13 | |
| D | 12 | |
| I | 12 | |
| T | 11 | |
| U | 11 | |
| C | 10 | 7.7% |
| R | 10 | 7.7% |
| N | 9 | 6.9% |
| A | 8 | 6.2% |
| O | 8 | 6.2% |
| Other values (7) | 26 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3 | |
| o | 3 | |
| r | 2 | 8.7% |
| t | 2 | 8.7% |
| h | 2 | 8.7% |
| i | 2 | 8.7% |
| p | 1 | 4.3% |
| e | 1 | 4.3% |
| c | 1 | 4.3% |
| l | 1 | 4.3% |
| Other values (5) | 5 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 1 | |
| 8 | 1 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 15 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 153 | |
| Common | 19 | 11.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 13 | 8.5% |
| D | 12 | 7.8% |
| I | 12 | 7.8% |
| T | 11 | 7.2% |
| U | 11 | 7.2% |
| C | 10 | 6.5% |
| R | 10 | 6.5% |
| N | 9 | 5.9% |
| A | 8 | 5.2% |
| O | 8 | 5.2% |
| Other values (22) | 49 |
Common
| Value | Count | Frequency (%) |
| _ | 15 | |
| ; | 2 | 10.5% |
| 4 | 1 | 5.3% |
| 8 | 1 | 5.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 172 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| _ | 15 | 8.7% |
| E | 13 | 7.6% |
| D | 12 | 7.0% |
| I | 12 | 7.0% |
| T | 11 | 6.4% |
| U | 11 | 6.4% |
| C | 10 | 5.8% |
| R | 10 | 5.8% |
| N | 9 | 5.2% |
| A | 8 | 4.7% |
| Other values (26) | 61 |
taxonConceptID
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361471 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 11.5 |
| Mean length | 11.5 |
| Min length | 10 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Magnoliopsida |
|---|---|
| 2nd row | StillImage |
| Value | Count | Frequency (%) |
| magnoliopsida | 1 | |
| stillimage | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3 | |
| l | 3 | |
| i | 3 | |
| g | 2 | 8.7% |
| o | 2 | 8.7% |
| M | 1 | 4.3% |
| n | 1 | 4.3% |
| p | 1 | 4.3% |
| s | 1 | 4.3% |
| d | 1 | 4.3% |
| Other values (5) | 5 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 20 | |
| Uppercase Letter | 3 | 13.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3 | |
| l | 3 | |
| i | 3 | |
| g | 2 | |
| o | 2 | |
| n | 1 | 5.0% |
| p | 1 | 5.0% |
| s | 1 | 5.0% |
| d | 1 | 5.0% |
| t | 1 | 5.0% |
| Other values (2) | 2 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1 | |
| S | 1 | |
| I | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3 | |
| l | 3 | |
| i | 3 | |
| g | 2 | 8.7% |
| o | 2 | 8.7% |
| M | 1 | 4.3% |
| n | 1 | 4.3% |
| p | 1 | 4.3% |
| s | 1 | 4.3% |
| d | 1 | 4.3% |
| Other values (5) | 5 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3 | |
| l | 3 | |
| i | 3 | |
| g | 2 | 8.7% |
| o | 2 | 8.7% |
| M | 1 | 4.3% |
| n | 1 | 4.3% |
| p | 1 | 4.3% |
| s | 1 | 4.3% |
| d | 1 | 4.3% |
| Other values (5) | 5 |
scientificName
Text
| Distinct | 362008 |
|---|---|
| Distinct (%) | 15.3% |
| Missing | 10 |
| Missing (%) | < 0.1% |
| Memory size | 18.0 MiB |
Length
| Max length | 234 |
|---|---|
| Median length | 109 |
| Mean length | 31.78332034 |
| Min length | 4 |
Unique
| Unique | 179316 ? |
|---|---|
| Unique (%) | 7.6% |
Sample
| 1st row | Hippolytidae |
|---|---|
| 2nd row | Lesquerella lescurii (A.Gray) S.Watson |
| 3rd row | Desmognathus ochrophaeus Cope, 1859 |
| 4th row | Scleractinia |
| 5th row | Ninoe kinbergi Ehlers, 1887 |
| Value | Count | Frequency (%) |
| 238208 | 2.6% | |
| l | 180173 | 2.0% |
| ex | 82914 | 0.9% |
| linnaeus | 79974 | 0.9% |
| 1758 | 62066 | 0.7% |
| var | 50926 | 0.6% |
| plethodon | 42963 | 0.5% |
| 1818 | 33176 | 0.4% |
| kunth | 29864 | 0.3% |
| dc | 29734 | 0.3% |
| Other values (185831) | 8293415 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6761950 | 9.0% | |
| a | 6242590 | 8.3% |
| i | 5013451 | 6.7% |
| e | 4703897 | 6.3% |
| r | 3970577 | 5.3% |
| s | 3847327 | 5.1% |
| o | 3620560 | 4.8% |
| n | 3486756 | 4.6% |
| l | 3394828 | 4.5% |
| u | 2955157 | 3.9% |
| Other values (118) | 31058042 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 52793618 | |
| Space Separator | 6761950 | 9.0% |
| Uppercase Letter | 6033053 | 8.0% |
| Decimal Number | 4371020 | 5.8% |
| Other Punctuation | 3122342 | 4.2% |
| Close Punctuation | 972201 | 1.3% |
| Open Punctuation | 972201 | 1.3% |
| Dash Punctuation | 25650 | < 0.1% |
| Math Symbol | 3079 | < 0.1% |
| Connector Punctuation | 21 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 6242590 | |
| i | 5013451 | 9.5% |
| e | 4703897 | 8.9% |
| r | 3970577 | 7.5% |
| s | 3847327 | 7.3% |
| o | 3620560 | 6.9% |
| n | 3486756 | 6.6% |
| l | 3394828 | 6.4% |
| u | 2955157 | 5.6% |
| t | 2758965 | 5.2% |
| Other values (56) | 12799510 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 571632 | 9.5% |
| S | 558239 | 9.3% |
| C | 547216 | 9.1% |
| P | 505361 | 8.4% |
| A | 411245 | 6.8% |
| B | 404229 | 6.7% |
| M | 403339 | 6.7% |
| H | 337256 | 5.6% |
| G | 316928 | 5.3% |
| D | 272745 | 4.5% |
| Other values (31) | 1704863 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1310245 | |
| 8 | 913594 | |
| 9 | 482023 | 11.0% |
| 7 | 349542 | 8.0% |
| 5 | 252107 | 5.8% |
| 0 | 229816 | 5.3% |
| 2 | 222891 | 5.1% |
| 6 | 220119 | 5.0% |
| 3 | 202794 | 4.6% |
| 4 | 187889 | 4.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1769758 | |
| , | 1109322 | |
| & | 238207 | 7.6% |
| ' | 5054 | 0.2% |
| ? | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 6761950 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 972201 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 972201 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 25650 |
Math Symbol
| Value | Count | Frequency (%) |
| × | 3079 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 21 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 58826671 | |
| Common | 16228464 | 21.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 6242590 | 10.6% |
| i | 5013451 | 8.5% |
| e | 4703897 | 8.0% |
| r | 3970577 | 6.7% |
| s | 3847327 | 6.5% |
| o | 3620560 | 6.2% |
| n | 3486756 | 5.9% |
| l | 3394828 | 5.8% |
| u | 2955157 | 5.0% |
| t | 2758965 | 4.7% |
| Other values (97) | 18832563 |
Common
| Value | Count | Frequency (%) |
| 6761950 | ||
| . | 1769758 | 10.9% |
| 1 | 1310245 | 8.1% |
| , | 1109322 | 6.8% |
| ) | 972201 | 6.0% |
| ( | 972201 | 6.0% |
| 8 | 913594 | 5.6% |
| 9 | 482023 | 3.0% |
| 7 | 349542 | 2.2% |
| 5 | 252107 | 1.6% |
| Other values (11) | 1335521 | 8.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 74931506 | |
| None | 123629 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6761950 | 9.0% | |
| a | 6242590 | 8.3% |
| i | 5013451 | 6.7% |
| e | 4703897 | 6.3% |
| r | 3970577 | 5.3% |
| s | 3847327 | 5.1% |
| o | 3620560 | 4.8% |
| n | 3486756 | 4.7% |
| l | 3394828 | 4.5% |
| u | 2955157 | 3.9% |
| Other values (62) | 30934413 |
None
| Value | Count | Frequency (%) |
| ü | 38809 | |
| é | 27121 | |
| ö | 16572 | |
| è | 11041 | 8.9% |
| å | 4789 | 3.9% |
| ø | 3665 | 3.0% |
| ä | 3627 | 2.9% |
| á | 3625 | 2.9% |
| × | 3079 | 2.5% |
| Á | 2054 | 1.7% |
| Other values (46) | 9247 | 7.5% |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 66.7% |
| Missing | 2361470 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 5 |
| Mean length | 5.666666667 |
| Min length | 5 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 33.3% |
Sample
| 1st row | false |
|---|---|
| 2nd row | SPECIES |
| 3rd row | false |
| Value | Count | Frequency (%) |
| false | 2 | |
| species | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| f | 2 | |
| a | 2 | |
| l | 2 | |
| s | 2 | |
| e | 2 | |
| S | 2 | |
| E | 2 | |
| P | 1 | |
| C | 1 | |
| I | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10 | |
| Uppercase Letter | 7 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| f | 2 | |
| a | 2 | |
| l | 2 | |
| s | 2 | |
| e | 2 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 2 | |
| E | 2 | |
| P | 1 | |
| C | 1 | |
| I | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 17 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| f | 2 | |
| a | 2 | |
| l | 2 | |
| s | 2 | |
| e | 2 | |
| S | 2 | |
| E | 2 | |
| P | 1 | |
| C | 1 | |
| I | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| f | 2 | |
| a | 2 | |
| l | 2 | |
| s | 2 | |
| e | 2 | |
| S | 2 | |
| E | 2 | |
| P | 1 | |
| C | 1 | |
| I | 1 |
parentNameUsage
Text
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361469 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 9.5 |
| Mean length | 8.25 |
| Min length | 7 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 3190721 |
|---|---|
| 2nd row | GEOLocate |
| 3rd row | Annonaceae |
| 4th row | 3869031 |
| Value | Count | Frequency (%) |
| 3190721 | 1 | |
| geolocate | 1 | |
| annonaceae | 1 | |
| 3869031 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 3 | 9.1% |
| a | 3 | 9.1% |
| n | 3 | 9.1% |
| e | 3 | 9.1% |
| 1 | 3 | 9.1% |
| 9 | 2 | 6.1% |
| 0 | 2 | 6.1% |
| o | 2 | 6.1% |
| c | 2 | 6.1% |
| 8 | 1 | 3.0% |
| Other values (9) | 9 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 14 | |
| Lowercase Letter | 14 | |
| Uppercase Letter | 5 | 15.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 3 | |
| 1 | 3 | |
| 9 | 2 | |
| 0 | 2 | |
| 8 | 1 | 7.1% |
| 2 | 1 | 7.1% |
| 7 | 1 | 7.1% |
| 6 | 1 | 7.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3 | |
| n | 3 | |
| e | 3 | |
| o | 2 | |
| c | 2 | |
| t | 1 | 7.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1 | |
| L | 1 | |
| O | 1 | |
| E | 1 | |
| G | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 19 | |
| Common | 14 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3 | |
| n | 3 | |
| e | 3 | |
| o | 2 | |
| c | 2 | |
| A | 1 | 5.3% |
| t | 1 | 5.3% |
| L | 1 | 5.3% |
| O | 1 | 5.3% |
| E | 1 | 5.3% |
Common
| Value | Count | Frequency (%) |
| 3 | 3 | |
| 1 | 3 | |
| 9 | 2 | |
| 0 | 2 | |
| 8 | 1 | 7.1% |
| 2 | 1 | 7.1% |
| 7 | 1 | 7.1% |
| 6 | 1 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 33 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 3 | 9.1% |
| a | 3 | 9.1% |
| n | 3 | 9.1% |
| e | 3 | 9.1% |
| 1 | 3 | 9.1% |
| 9 | 2 | 6.1% |
| 0 | 2 | 6.1% |
| o | 2 | 6.1% |
| c | 2 | 6.1% |
| 8 | 1 | 3.0% |
| Other values (9) | 9 |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361471 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 3190721 |
|---|---|
| 2nd row | 3869031 |
| Value | Count | Frequency (%) |
| 3190721 | 1 | |
| 3869031 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 3 | |
| 1 | 3 | |
| 9 | 2 | |
| 0 | 2 | |
| 7 | 1 | 7.1% |
| 2 | 1 | 7.1% |
| 8 | 1 | 7.1% |
| 6 | 1 | 7.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 14 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 3 | |
| 1 | 3 | |
| 9 | 2 | |
| 0 | 2 | |
| 7 | 1 | 7.1% |
| 2 | 1 | 7.1% |
| 8 | 1 | 7.1% |
| 6 | 1 | 7.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 14 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 3 | |
| 1 | 3 | |
| 9 | 2 | |
| 0 | 2 | |
| 7 | 1 | 7.1% |
| 2 | 1 | 7.1% |
| 8 | 1 | 7.1% |
| 6 | 1 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 3 | |
| 1 | 3 | |
| 9 | 2 | |
| 0 | 2 | |
| 7 | 1 | 7.1% |
| 2 | 1 | 7.1% |
| 8 | 1 | 7.1% |
| 6 | 1 | 7.1% |
nameAccordingTo
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 2361471 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 6 |
|---|---|
| 2nd row | 6 |
| Value | Count | Frequency (%) |
| 6 | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 6 | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6 | 2 |
namePublishedIn
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 66.7% |
| Missing | 2361470 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.333333333 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 33.3% |
Sample
| 1st row | 7707728 |
|---|---|
| 2nd row | ACCEPTED |
| 3rd row | 7707728 |
| Value | Count | Frequency (%) |
| 7707728 | 2 | |
| accepted | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | 8 | |
| 0 | 2 | 9.1% |
| 2 | 2 | 9.1% |
| 8 | 2 | 9.1% |
| C | 2 | 9.1% |
| E | 2 | 9.1% |
| A | 1 | 4.5% |
| P | 1 | 4.5% |
| T | 1 | 4.5% |
| D | 1 | 4.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 14 | |
| Uppercase Letter | 8 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 2 | |
| E | 2 | |
| A | 1 | |
| P | 1 | |
| T | 1 | |
| D | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 8 | |
| 0 | 2 | 14.3% |
| 2 | 2 | 14.3% |
| 8 | 2 | 14.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 14 | |
| Latin | 8 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 2 | |
| E | 2 | |
| A | 1 | |
| P | 1 | |
| T | 1 | |
| D | 1 |
Common
| Value | Count | Frequency (%) |
| 7 | 8 | |
| 0 | 2 | 14.3% |
| 2 | 2 | 14.3% |
| 8 | 2 | 14.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 22 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7 | 8 | |
| 0 | 2 | 9.1% |
| 2 | 2 | 9.1% |
| 8 | 2 | 9.1% |
| C | 2 | 9.1% |
| E | 2 | 9.1% |
| A | 1 | 4.5% |
| P | 1 | 4.5% |
| T | 1 | 4.5% |
| D | 1 | 4.5% |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 66.7% |
| Missing | 2361470 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 3 |
| Mean length | 5 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 33.3% |
Sample
| 1st row | 220 |
|---|---|
| 2nd row | Guatteria |
| 3rd row | 220 |
| Value | Count | Frequency (%) |
| 220 | 2 | |
| guatteria | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 4 | |
| 0 | 2 | |
| a | 2 | |
| t | 2 | |
| G | 1 | 6.7% |
| u | 1 | 6.7% |
| e | 1 | 6.7% |
| r | 1 | 6.7% |
| i | 1 | 6.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8 | |
| Decimal Number | 6 | |
| Uppercase Letter | 1 | 6.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2 | |
| t | 2 | |
| u | 1 | |
| e | 1 | |
| r | 1 | |
| i | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 4 | |
| 0 | 2 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9 | |
| Common | 6 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2 | |
| t | 2 | |
| G | 1 | |
| u | 1 | |
| e | 1 | |
| r | 1 | |
| i | 1 |
Common
| Value | Count | Frequency (%) |
| 2 | 4 | |
| 0 | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 4 | |
| 0 | 2 | |
| a | 2 | |
| t | 2 | |
| G | 1 | 6.7% |
| u | 1 | 6.7% |
| e | 1 | 6.7% |
| r | 1 | 6.7% |
| i | 1 | 6.7% |
| Distinct | 9381 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 5000 |
| Missing (%) | 0.2% |
| Memory size | 18.0 MiB |
Length
| Max length | 164 |
|---|---|
| Median length | 148 |
| Mean length | 65.02881001 |
| Min length | 3 |
Unique
| Unique | 1505 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Animalia, Arthropoda, Crustacea, Malacostraca, Eumalacostraca, Eucarida, Decapoda, Pleocyemata, Hippolytidae |
|---|---|
| 2nd row | Plantae, Dicotyledonae, Brassicales, Brassicaceae, Brassicoideae |
| 3rd row | Animalia, Chordata, Vertebrata, Amphibia, Caudata, Plethodontidae |
| 4th row | Animalia, Cnidaria, Anthozoa, Hexacorallia, Scleractinia |
| 5th row | Animalia, Annelida, Polychaeta, Errantia, Eunicida, Lumbrineridae |
| Value | Count | Frequency (%) |
| animalia | 1209335 | 9.1% |
| plantae | 1054356 | 7.9% |
| dicotyledonae | 657170 | 4.9% |
| chordata | 572776 | 4.3% |
| vertebrata | 567549 | 4.3% |
| arthropoda | 251879 | 1.9% |
| monocotyledonae | 231105 | 1.7% |
| mollusca | 220773 | 1.7% |
| poales | 178488 | 1.3% |
| gastropoda | 155944 | 1.2% |
| Other values (9606) | 8232767 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 21342566 | |
| e | 15326706 | 10.0% |
| i | 11163562 | 7.3% |
| 10975669 | 7.2% | |
| , | 10942165 | 7.1% |
| o | 9722855 | 6.3% |
| t | 8276865 | 5.4% |
| l | 7536114 | 4.9% |
| r | 7095165 | 4.6% |
| n | 6565956 | 4.3% |
| Other values (62) | 44291012 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 117973422 | |
| Uppercase Letter | 13298578 | 8.7% |
| Space Separator | 10975669 | 7.2% |
| Other Punctuation | 10947398 | 7.1% |
| Open Punctuation | 21699 | < 0.1% |
| Close Punctuation | 21699 | < 0.1% |
| Dash Punctuation | 127 | < 0.1% |
| Connector Punctuation | 31 | < 0.1% |
| Decimal Number | 11 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 21342566 | |
| e | 15326706 | |
| i | 11163562 | |
| o | 9722855 | |
| t | 8276865 | 7.0% |
| l | 7536114 | 6.4% |
| r | 7095165 | 6.0% |
| n | 6565956 | 5.6% |
| d | 5401327 | 4.6% |
| c | 5122421 | 4.3% |
| Other values (17) | 20419885 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2756481 | |
| P | 2449943 | |
| C | 1607999 | |
| M | 1121374 | |
| D | 843351 | 6.3% |
| V | 632813 | 4.8% |
| E | 535529 | 4.0% |
| S | 528149 | 4.0% |
| L | 345229 | 2.6% |
| R | 342094 | 2.6% |
| Other values (16) | 2135616 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 2 | |
| 9 | 2 | |
| 0 | 2 | |
| 2 | 2 | |
| 4 | 1 | |
| 1 | 1 | |
| 3 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 10942165 | |
| . | 5211 | < 0.1% |
| ? | 16 | < 0.1% |
| / | 6 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 21661 | |
| [ | 38 | 0.2% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 21661 | |
| ] | 38 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 10975669 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 127 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 31 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 131272000 | |
| Common | 21966635 | 14.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 21342566 | |
| e | 15326706 | |
| i | 11163562 | 8.5% |
| o | 9722855 | 7.4% |
| t | 8276865 | 6.3% |
| l | 7536114 | 5.7% |
| r | 7095165 | 5.4% |
| n | 6565956 | 5.0% |
| d | 5401327 | 4.1% |
| c | 5122421 | 3.9% |
| Other values (43) | 33718463 |
Common
| Value | Count | Frequency (%) |
| 10975669 | ||
| , | 10942165 | |
| ( | 21661 | 0.1% |
| ) | 21661 | 0.1% |
| . | 5211 | < 0.1% |
| - | 127 | < 0.1% |
| [ | 38 | < 0.1% |
| ] | 38 | < 0.1% |
| _ | 31 | < 0.1% |
| ? | 16 | < 0.1% |
| Other values (9) | 18 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 153238470 | |
| None | 165 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 21342566 | |
| e | 15326706 | 10.0% |
| i | 11163562 | 7.3% |
| 10975669 | 7.2% | |
| , | 10942165 | 7.1% |
| o | 9722855 | 6.3% |
| t | 8276865 | 5.4% |
| l | 7536114 | 4.9% |
| r | 7095165 | 4.6% |
| n | 6565956 | 4.3% |
| Other values (61) | 44290847 |
None
| Value | Count | Frequency (%) |
| ö | 165 |
kingdom
Text
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 10 |
| Missing (%) | < 0.1% |
| Memory size | 18.0 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 8 |
| Mean length | 7.504671892 |
| Min length | 4 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Animalia |
|---|---|
| 2nd row | Plantae |
| 3rd row | Animalia |
| 4th row | Animalia |
| 5th row | Animalia |
| Value | Count | Frequency (%) |
| animalia | 1209386 | |
| plantae | 1054744 | |
| fungi | 56807 | 2.4% |
| chromista | 20874 | 0.9% |
| bacteria | 13612 | 0.6% |
| incertae | 5762 | 0.2% |
| sedis | 5762 | 0.2% |
| protozoa | 275 | < 0.1% |
| 5399 | 1 | < 0.1% |
| 821cc27a-e3bb-4bc5-ac34-89ada245069d | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4582399 | |
| i | 2521589 | |
| n | 2326699 | |
| l | 2264130 | |
| m | 1230260 | 6.9% |
| A | 1209386 | 6.8% |
| t | 1095267 | 6.2% |
| e | 1085643 | 6.1% |
| P | 1055019 | 6.0% |
| u | 56807 | 0.3% |
| Other values (24) | 294806 | 1.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 15360515 | |
| Uppercase Letter | 2355698 | 13.3% |
| Space Separator | 5762 | < 0.1% |
| Decimal Number | 26 | < 0.1% |
| Dash Punctuation | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4582399 | |
| i | 2521589 | |
| n | 2326699 | |
| l | 2264130 | |
| m | 1230260 | 8.0% |
| t | 1095267 | 7.1% |
| e | 1085643 | 7.1% |
| u | 56807 | 0.4% |
| g | 56807 | 0.4% |
| r | 40523 | 0.3% |
| Other values (7) | 100391 | 0.7% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 4 | |
| 9 | 4 | |
| 5 | 3 | |
| 8 | 3 | |
| 2 | 3 | |
| 4 | 3 | |
| 6 | 3 | |
| 1 | 1 | 3.8% |
| 7 | 1 | 3.8% |
| 0 | 1 | 3.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1209386 | |
| P | 1055019 | |
| F | 56807 | 2.4% |
| C | 20874 | 0.9% |
| B | 13612 | 0.6% |
Space Separator
| Value | Count | Frequency (%) |
| 5762 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 17716213 | |
| Common | 5792 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4582399 | |
| i | 2521589 | |
| n | 2326699 | |
| l | 2264130 | |
| m | 1230260 | 6.9% |
| A | 1209386 | 6.8% |
| t | 1095267 | 6.2% |
| e | 1085643 | 6.1% |
| P | 1055019 | 6.0% |
| u | 56807 | 0.3% |
| Other values (12) | 289014 | 1.6% |
Common
| Value | Count | Frequency (%) |
| 5762 | ||
| 3 | 4 | 0.1% |
| 9 | 4 | 0.1% |
| - | 4 | 0.1% |
| 5 | 3 | 0.1% |
| 8 | 3 | 0.1% |
| 2 | 3 | 0.1% |
| 4 | 3 | 0.1% |
| 6 | 3 | 0.1% |
| 1 | 1 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17722005 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4582399 | |
| i | 2521589 | |
| n | 2326699 | |
| l | 2264130 | |
| m | 1230260 | 6.9% |
| A | 1209386 | 6.8% |
| t | 1095267 | 6.2% |
| e | 1085643 | 6.1% |
| P | 1055019 | 6.0% |
| u | 56807 | 0.3% |
| Other values (24) | 294806 | 1.7% |
phylum
Text
| Distinct | 64 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 7896 |
| Missing (%) | 0.3% |
| Memory size | 18.0 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 16 |
| Mean length | 10.11762054 |
| Min length | 2 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Arthropoda |
|---|---|
| 2nd row | Tracheophyta |
| 3rd row | Chordata |
| 4th row | Cnidaria |
| 5th row | Annelida |
| Value | Count | Frequency (%) |
| tracheophyta | 965311 | |
| chordata | 572771 | |
| arthropoda | 252406 | 10.7% |
| mollusca | 220179 | 9.4% |
| annelida | 61416 | 2.6% |
| ascomycota | 56083 | 2.4% |
| bryophyta | 37922 | 1.6% |
| rhodophyta | 30954 | 1.3% |
| cnidaria | 29998 | 1.3% |
| echinodermata | 23220 | 1.0% |
| Other values (54) | 103317 | 4.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4009610 | |
| h | 2984383 | |
| o | 2602587 | |
| r | 2209125 | |
| t | 2042901 | |
| c | 1367427 | 5.7% |
| p | 1328235 | 5.6% |
| y | 1197926 | 5.0% |
| e | 1120639 | 4.7% |
| d | 989852 | 4.2% |
| Other values (37) | 3959914 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 21459009 | |
| Uppercase Letter | 2353576 | 9.9% |
| Decimal Number | 14 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4009610 | |
| h | 2984383 | |
| o | 2602587 | |
| r | 2209125 | |
| t | 2042901 | |
| c | 1367427 | 6.4% |
| p | 1328235 | 6.2% |
| y | 1197926 | 5.6% |
| e | 1120639 | 5.2% |
| d | 989852 | 4.6% |
| Other values (10) | 1606324 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 965446 | |
| C | 629275 | |
| A | 371272 | 15.8% |
| M | 229756 | 9.8% |
| B | 40910 | 1.7% |
| R | 31169 | 1.3% |
| E | 23307 | 1.0% |
| P | 20411 | 0.9% |
| N | 19086 | 0.8% |
| O | 17979 | 0.8% |
| Other values (9) | 4965 | 0.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 3 | |
| 8 | 3 | |
| 3 | 2 | |
| 5 | 2 | |
| 9 | 1 | 7.1% |
| 0 | 1 | 7.1% |
| 7 | 1 | 7.1% |
| 2 | 1 | 7.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23812585 | |
| Common | 14 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4009610 | |
| h | 2984383 | |
| o | 2602587 | |
| r | 2209125 | |
| t | 2042901 | |
| c | 1367427 | 5.7% |
| p | 1328235 | 5.6% |
| y | 1197926 | 5.0% |
| e | 1120639 | 4.7% |
| d | 989852 | 4.2% |
| Other values (29) | 3959900 |
Common
| Value | Count | Frequency (%) |
| 1 | 3 | |
| 8 | 3 | |
| 3 | 2 | |
| 5 | 2 | |
| 9 | 1 | 7.1% |
| 0 | 1 | 7.1% |
| 7 | 1 | 7.1% |
| 2 | 1 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23812599 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4009610 | |
| h | 2984383 | |
| o | 2602587 | |
| r | 2209125 | |
| t | 2042901 | |
| c | 1367427 | 5.7% |
| p | 1328235 | 5.6% |
| y | 1197926 | 5.0% |
| e | 1120639 | 4.7% |
| d | 989852 | 4.2% |
| Other values (37) | 3959914 |
class
Text
Missing 
| Distinct | 186 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 138563 |
| Missing (%) | 5.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 20 |
| Mean length | 10.43315114 |
| Min length | 4 |
Unique
| Unique | 20 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Malacostraca |
|---|---|
| 2nd row | Magnoliopsida |
| 3rd row | Amphibia |
| 4th row | Anthozoa |
| 5th row | Polychaeta |
| Value | Count | Frequency (%) |
| magnoliopsida | 657370 | |
| liliopsida | 231154 | 10.4% |
| gastropoda | 155259 | 7.0% |
| mammalia | 152953 | 6.9% |
| insecta | 149742 | 6.7% |
| aves | 149231 | 6.7% |
| amphibia | 100689 | 4.5% |
| malacostraca | 76525 | 3.4% |
| polypodiopsida | 63916 | 2.9% |
| polychaeta | 53619 | 2.4% |
| Other values (176) | 432452 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3739588 | |
| i | 2845281 | |
| o | 2681344 | |
| s | 1638014 | 7.1% |
| p | 1464517 | 6.3% |
| l | 1407397 | 6.1% |
| d | 1364592 | 5.9% |
| n | 963712 | 4.2% |
| M | 891567 | 3.8% |
| e | 817658 | 3.5% |
| Other values (49) | 5378286 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 20969024 | |
| Uppercase Letter | 2222910 | 9.6% |
| Decimal Number | 17 | < 0.1% |
| Other Punctuation | 3 | < 0.1% |
| Dash Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3739588 | |
| i | 2845281 | |
| o | 2681344 | |
| s | 1638014 | |
| p | 1464517 | 7.0% |
| l | 1407397 | 6.7% |
| d | 1364592 | 6.5% |
| n | 963712 | 4.6% |
| e | 817658 | 3.9% |
| g | 676609 | 3.2% |
| Other values (15) | 3370312 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 891567 | |
| L | 289993 | 13.0% |
| A | 289952 | 13.0% |
| G | 157763 | 7.1% |
| I | 149750 | 6.7% |
| P | 138048 | 6.2% |
| B | 98443 | 4.4% |
| C | 59514 | 2.7% |
| S | 49464 | 2.2% |
| F | 30189 | 1.4% |
| Other values (12) | 68227 | 3.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 4 | |
| 1 | 4 | |
| 0 | 2 | |
| 9 | 2 | |
| 4 | 1 | 5.9% |
| 3 | 1 | 5.9% |
| 5 | 1 | 5.9% |
| 7 | 1 | 5.9% |
| 6 | 1 | 5.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 2 | |
| . | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23191934 | |
| Common | 22 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3739588 | |
| i | 2845281 | |
| o | 2681344 | |
| s | 1638014 | 7.1% |
| p | 1464517 | 6.3% |
| l | 1407397 | 6.1% |
| d | 1364592 | 5.9% |
| n | 963712 | 4.2% |
| M | 891567 | 3.8% |
| e | 817658 | 3.5% |
| Other values (37) | 5378264 |
Common
| Value | Count | Frequency (%) |
| 2 | 4 | |
| 1 | 4 | |
| 0 | 2 | |
| - | 2 | |
| : | 2 | |
| 9 | 2 | |
| 4 | 1 | 4.5% |
| 3 | 1 | 4.5% |
| 5 | 1 | 4.5% |
| 7 | 1 | 4.5% |
| Other values (2) | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23191956 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3739588 | |
| i | 2845281 | |
| o | 2681344 | |
| s | 1638014 | 7.1% |
| p | 1464517 | 6.3% |
| l | 1407397 | 6.1% |
| d | 1364592 | 5.9% |
| n | 963712 | 4.2% |
| M | 891567 | 3.8% |
| e | 817658 | 3.5% |
| Other values (49) | 5378286 |
order
Text
Missing 
| Distinct | 926 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 145729 |
| Missing (%) | 6.2% |
| Memory size | 18.0 MiB |
Length
| Max length | 22 |
|---|---|
| Median length | 19 |
| Mean length | 9.927911798 |
| Min length | 5 |
Unique
| Unique | 75 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Decapoda |
|---|---|
| 2nd row | Brassicales |
| 3rd row | Caudata |
| 4th row | Scleractinia |
| 5th row | Eunicida |
| Value | Count | Frequency (%) |
| poales | 178531 | 8.1% |
| asterales | 96944 | 4.4% |
| passeriformes | 94751 | 4.3% |
| rodentia | 75757 | 3.4% |
| lamiales | 67866 | 3.1% |
| fabales | 64632 | 2.9% |
| caudata | 60565 | 2.7% |
| perciformes | 54527 | 2.5% |
| malpighiales | 53482 | 2.4% |
| decapoda | 49962 | 2.3% |
| Other values (916) | 1418727 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3200606 | |
| e | 2456217 | |
| s | 1919697 | 8.7% |
| l | 1767329 | 8.0% |
| o | 1673369 | 7.6% |
| i | 1557251 | 7.1% |
| r | 1440556 | 6.5% |
| t | 919616 | 4.2% |
| p | 728104 | 3.3% |
| n | 715516 | 3.3% |
| Other values (46) | 5619450 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 19781961 | |
| Uppercase Letter | 2215743 | 10.1% |
| Decimal Number | 7 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3200606 | |
| e | 2456217 | |
| s | 1919697 | |
| l | 1767329 | |
| o | 1673369 | |
| i | 1557251 | |
| r | 1440556 | |
| t | 919616 | 4.6% |
| p | 728104 | 3.7% |
| n | 715516 | 3.6% |
| Other values (16) | 3403700 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 469423 | |
| C | 313394 | |
| A | 244366 | |
| L | 185525 | 8.4% |
| S | 156732 | 7.1% |
| M | 140519 | 6.3% |
| R | 133204 | 6.0% |
| D | 93481 | 4.2% |
| F | 78521 | 3.5% |
| H | 72334 | 3.3% |
| Other values (14) | 328244 |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 2 | |
| 8 | 1 | |
| 6 | 1 | |
| 9 | 1 | |
| 0 | 1 | |
| 1 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 21997704 | |
| Common | 7 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3200606 | |
| e | 2456217 | |
| s | 1919697 | 8.7% |
| l | 1767329 | 8.0% |
| o | 1673369 | 7.6% |
| i | 1557251 | 7.1% |
| r | 1440556 | 6.5% |
| t | 919616 | 4.2% |
| p | 728104 | 3.3% |
| n | 715516 | 3.3% |
| Other values (40) | 5619443 |
Common
| Value | Count | Frequency (%) |
| 3 | 2 | |
| 8 | 1 | |
| 6 | 1 | |
| 9 | 1 | |
| 0 | 1 | |
| 1 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21997711 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3200606 | |
| e | 2456217 | |
| s | 1919697 | 8.7% |
| l | 1767329 | 8.0% |
| o | 1673369 | 7.6% |
| i | 1557251 | 7.1% |
| r | 1440556 | 6.5% |
| t | 919616 | 4.2% |
| p | 728104 | 3.3% |
| n | 715516 | 3.3% |
| Other values (46) | 5619450 |
superfamily
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361471 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 16 |
|---|---|
| Median length | 11.5 |
| Mean length | 11.5 |
| Min length | 7 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 3034046 |
|---|---|
| 2nd row | Miconia coronata |
| Value | Count | Frequency (%) |
| 3034046 | 1 | |
| miconia | 1 | |
| coronata | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 3 | |
| a | 3 | |
| 3 | 2 | |
| 0 | 2 | |
| 4 | 2 | |
| i | 2 | |
| c | 2 | |
| n | 2 | |
| 6 | 1 | 4.3% |
| M | 1 | 4.3% |
| Other values (3) | 3 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 14 | |
| Decimal Number | 7 | |
| Uppercase Letter | 1 | 4.3% |
| Space Separator | 1 | 4.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 3 | |
| a | 3 | |
| i | 2 | |
| c | 2 | |
| n | 2 | |
| r | 1 | 7.1% |
| t | 1 | 7.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 2 | |
| 0 | 2 | |
| 4 | 2 | |
| 6 | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15 | |
| Common | 8 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 3 | |
| a | 3 | |
| i | 2 | |
| c | 2 | |
| n | 2 | |
| M | 1 | 6.7% |
| r | 1 | 6.7% |
| t | 1 | 6.7% |
Common
| Value | Count | Frequency (%) |
| 3 | 2 | |
| 0 | 2 | |
| 4 | 2 | |
| 6 | 1 | |
| 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 3 | |
| a | 3 | |
| 3 | 2 | |
| 0 | 2 | |
| 4 | 2 | |
| i | 2 | |
| c | 2 | |
| n | 2 | |
| 6 | 1 | 4.3% |
| M | 1 | 4.3% |
| Other values (3) | 3 |
family
Text
Missing 
| Distinct | 6622 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 52497 |
| Missing (%) | 2.2% |
| Memory size | 18.0 MiB |
Length
| Max length | 29 |
|---|---|
| Median length | 21 |
| Mean length | 10.85806089 |
| Min length | 6 |
Unique
| Unique | 722 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Hippolytidae |
|---|---|
| 2nd row | Brassicaceae |
| 3rd row | Plethodontidae |
| 4th row | Lumbrineridae |
| 5th row | Gomphidae |
| Value | Count | Frequency (%) |
| poaceae | 128004 | 5.5% |
| asteraceae | 91253 | 4.0% |
| fabaceae | 60425 | 2.6% |
| plethodontidae | 56509 | 2.4% |
| cyperaceae | 35190 | 1.5% |
| rubiaceae | 30478 | 1.3% |
| cricetidae | 27411 | 1.2% |
| muridae | 23714 | 1.0% |
| apidae | 20894 | 0.9% |
| melastomataceae | 18664 | 0.8% |
| Other values (6616) | 1816438 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4499005 | |
| e | 4442765 | |
| i | 2325126 | |
| c | 1720240 | 6.9% |
| d | 1491888 | 6.0% |
| o | 1265964 | 5.0% |
| r | 1175408 | 4.7% |
| l | 1016477 | 4.1% |
| t | 859959 | 3.4% |
| n | 822238 | 3.3% |
| Other values (47) | 5451932 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 22761967 | |
| Uppercase Letter | 2309006 | 9.2% |
| Connector Punctuation | 20 | < 0.1% |
| Space Separator | 4 | < 0.1% |
| Other Punctuation | 3 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4499005 | |
| e | 4442765 | |
| i | 2325126 | |
| c | 1720240 | 7.6% |
| d | 1491888 | 6.6% |
| o | 1265964 | 5.6% |
| r | 1175408 | 5.2% |
| l | 1016477 | 4.5% |
| t | 859959 | 3.8% |
| n | 822238 | 3.6% |
| Other values (16) | 3142897 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 464221 | |
| C | 329853 | |
| A | 269139 | |
| S | 164131 | 7.1% |
| M | 163856 | 7.1% |
| L | 109923 | 4.8% |
| R | 92205 | 4.0% |
| F | 87633 | 3.8% |
| T | 85890 | 3.7% |
| B | 75520 | 3.3% |
| Other values (16) | 466635 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 20 |
Space Separator
| Value | Count | Frequency (%) |
| 4 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 25070973 | |
| Common | 29 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4499005 | |
| e | 4442765 | |
| i | 2325126 | |
| c | 1720240 | 6.9% |
| d | 1491888 | 6.0% |
| o | 1265964 | 5.0% |
| r | 1175408 | 4.7% |
| l | 1016477 | 4.1% |
| t | 859959 | 3.4% |
| n | 822238 | 3.3% |
| Other values (42) | 5451903 |
Common
| Value | Count | Frequency (%) |
| _ | 20 | |
| 4 | 13.8% | |
| . | 3 | 10.3% |
| ( | 1 | 3.4% |
| ) | 1 | 3.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 25071002 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4499005 | |
| e | 4442765 | |
| i | 2325126 | |
| c | 1720240 | 6.9% |
| d | 1491888 | 6.0% |
| o | 1265964 | 5.0% |
| r | 1175408 | 4.7% |
| l | 1016477 | 4.1% |
| t | 859959 | 3.4% |
| n | 822238 | 3.3% |
| Other values (47) | 5451932 |
subfamily
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361471 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 16 |
|---|---|
| Median length | 13.5 |
| Mean length | 13.5 |
| Min length | 11 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Drosera sp. |
|---|---|
| 2nd row | Miconia coronata |
| Value | Count | Frequency (%) |
| drosera | 1 | |
| sp | 1 | |
| miconia | 1 | |
| coronata | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 4 | |
| a | 4 | |
| r | 3 | |
| s | 2 | |
| 2 | ||
| i | 2 | |
| c | 2 | |
| n | 2 | |
| D | 1 | 3.7% |
| e | 1 | 3.7% |
| Other values (4) | 4 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 22 | |
| Space Separator | 2 | 7.4% |
| Uppercase Letter | 2 | 7.4% |
| Other Punctuation | 1 | 3.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 4 | |
| a | 4 | |
| r | 3 | |
| s | 2 | |
| i | 2 | |
| c | 2 | |
| n | 2 | |
| e | 1 | 4.5% |
| p | 1 | 4.5% |
| t | 1 | 4.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 1 | |
| M | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 24 | |
| Common | 3 | 11.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 4 | |
| a | 4 | |
| r | 3 | |
| s | 2 | |
| i | 2 | |
| c | 2 | |
| n | 2 | |
| D | 1 | 4.2% |
| e | 1 | 4.2% |
| p | 1 | 4.2% |
| Other values (2) | 2 |
Common
| Value | Count | Frequency (%) |
| 2 | ||
| . | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 27 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 4 | |
| a | 4 | |
| r | 3 | |
| s | 2 | |
| 2 | ||
| i | 2 | |
| c | 2 | |
| n | 2 | |
| D | 1 | 3.7% |
| e | 1 | 3.7% |
| Other values (4) | 4 |
subtribe
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 66.7% |
| Missing | 2361470 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 77 |
|---|---|
| Median length | 3 |
| Mean length | 27.66666667 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 33.3% |
Sample
| 1st row | EML |
|---|---|
| 2nd row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;GEODETIC_DATUM_ASSUMED_WGS84 |
| 3rd row | EML |
| Value | Count | Frequency (%) |
| eml | 2 | |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 9 | 10.8% |
| _ | 8 | 9.6% |
| D | 6 | 7.2% |
| U | 6 | 7.2% |
| I | 5 | 6.0% |
| M | 5 | 6.0% |
| S | 5 | 6.0% |
| T | 5 | 6.0% |
| R | 5 | 6.0% |
| C | 5 | 6.0% |
| Other values (11) | 24 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 72 | |
| Connector Punctuation | 8 | 9.6% |
| Decimal Number | 2 | 2.4% |
| Other Punctuation | 1 | 1.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 9 | |
| D | 6 | 8.3% |
| U | 6 | 8.3% |
| I | 5 | 6.9% |
| M | 5 | 6.9% |
| S | 5 | 6.9% |
| T | 5 | 6.9% |
| R | 5 | 6.9% |
| C | 5 | 6.9% |
| N | 4 | 5.6% |
| Other values (7) | 17 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 1 | |
| 4 | 1 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 8 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 72 | |
| Common | 11 | 13.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 9 | |
| D | 6 | 8.3% |
| U | 6 | 8.3% |
| I | 5 | 6.9% |
| M | 5 | 6.9% |
| S | 5 | 6.9% |
| T | 5 | 6.9% |
| R | 5 | 6.9% |
| C | 5 | 6.9% |
| N | 4 | 5.6% |
| Other values (7) | 17 |
Common
| Value | Count | Frequency (%) |
| _ | 8 | |
| ; | 1 | 9.1% |
| 8 | 1 | 9.1% |
| 4 | 1 | 9.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 83 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 9 | 10.8% |
| _ | 8 | 9.6% |
| D | 6 | 7.2% |
| U | 6 | 7.2% |
| I | 5 | 6.0% |
| M | 5 | 6.0% |
| S | 5 | 6.0% |
| T | 5 | 6.0% |
| R | 5 | 6.0% |
| C | 5 | 6.0% |
| Other values (11) | 24 |
genus
Text
Missing 
| Distinct | 58510 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 120652 |
| Missing (%) | 5.1% |
| Memory size | 18.0 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 21 |
| Mean length | 9.034970665 |
| Min length | 2 |
Unique
| Unique | 16466 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | Paysonia |
|---|---|
| 2nd row | Desmognathus |
| 3rd row | Ninoe |
| 4th row | Hylogomphus |
| 5th row | Skrjabinoclava |
| Value | Count | Frequency (%) |
| plethodon | 42953 | 1.9% |
| bombus | 15824 | 0.7% |
| carex | 14686 | 0.7% |
| miconia | 10093 | 0.5% |
| peromyscus | 10025 | 0.4% |
| desmognathus | 9258 | 0.4% |
| cladonia | 7917 | 0.4% |
| poa | 7658 | 0.3% |
| cyperus | 7007 | 0.3% |
| paspalum | 6575 | 0.3% |
| Other values (58499) | 2108825 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2232664 | 11.0% |
| i | 1675014 | 8.3% |
| o | 1648843 | 8.1% |
| e | 1402186 | 6.9% |
| s | 1326799 | 6.6% |
| r | 1282975 | 6.3% |
| l | 1123780 | 5.6% |
| u | 1022138 | 5.0% |
| n | 994213 | 4.9% |
| t | 952492 | 4.7% |
| Other values (54) | 6584648 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 18004558 | |
| Uppercase Letter | 2240850 | 11.1% |
| Dash Punctuation | 304 | < 0.1% |
| Decimal Number | 34 | < 0.1% |
| Other Punctuation | 6 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2232664 | |
| i | 1675014 | 9.3% |
| o | 1648843 | 9.2% |
| e | 1402186 | 7.8% |
| s | 1326799 | 7.4% |
| r | 1282975 | 7.1% |
| l | 1123780 | 6.2% |
| u | 1022138 | 5.7% |
| n | 994213 | 5.5% |
| t | 952492 | 5.3% |
| Other values (16) | 4343454 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 337157 | |
| C | 290480 | |
| S | 207209 | |
| A | 204837 | |
| M | 154295 | 6.9% |
| E | 121845 | 5.4% |
| L | 119146 | 5.3% |
| T | 103927 | 4.6% |
| D | 102293 | 4.6% |
| B | 93056 | 4.2% |
| Other values (16) | 506605 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 9 | |
| 0 | 7 | |
| 5 | 5 | |
| 1 | 4 | |
| 4 | 3 | 8.8% |
| 3 | 2 | 5.9% |
| 7 | 2 | 5.9% |
| 8 | 1 | 2.9% |
| 9 | 1 | 2.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 4 | |
| . | 2 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 304 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 20245408 | |
| Common | 344 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2232664 | 11.0% |
| i | 1675014 | 8.3% |
| o | 1648843 | 8.1% |
| e | 1402186 | 6.9% |
| s | 1326799 | 6.6% |
| r | 1282975 | 6.3% |
| l | 1123780 | 5.6% |
| u | 1022138 | 5.0% |
| n | 994213 | 4.9% |
| t | 952492 | 4.7% |
| Other values (42) | 6584304 |
Common
| Value | Count | Frequency (%) |
| - | 304 | |
| 2 | 9 | 2.6% |
| 0 | 7 | 2.0% |
| 5 | 5 | 1.5% |
| 1 | 4 | 1.2% |
| : | 4 | 1.2% |
| 4 | 3 | 0.9% |
| 3 | 2 | 0.6% |
| 7 | 2 | 0.6% |
| . | 2 | 0.6% |
| Other values (2) | 2 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20245752 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2232664 | 11.0% |
| i | 1675014 | 8.3% |
| o | 1648843 | 8.1% |
| e | 1402186 | 6.9% |
| s | 1326799 | 6.6% |
| r | 1282975 | 6.3% |
| l | 1123780 | 5.6% |
| u | 1022138 | 5.0% |
| n | 994213 | 4.9% |
| t | 952492 | 4.7% |
| Other values (54) | 6584648 |
genericName
Text
Missing 
| Distinct | 60031 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 120743 |
| Missing (%) | 5.1% |
| Memory size | 18.0 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 21 |
| Mean length | 8.952593128 |
| Min length | 1 |
Unique
| Unique | 18598 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | Lesquerella |
|---|---|
| 2nd row | Desmognathus |
| 3rd row | Ninoe |
| 4th row | Gomphus |
| 5th row | Skrjabinoclava |
| Value | Count | Frequency (%) |
| plethodon | 42953 | 1.9% |
| bombus | 15821 | 0.7% |
| carex | 14678 | 0.7% |
| peromyscus | 10025 | 0.4% |
| desmognathus | 9258 | 0.4% |
| poa | 7661 | 0.3% |
| cyperus | 6995 | 0.3% |
| cladonia | 6779 | 0.3% |
| paspalum | 6559 | 0.3% |
| solanum | 6347 | 0.3% |
| Other values (60020) | 2113656 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2206605 | 11.0% |
| i | 1657025 | 8.3% |
| o | 1617618 | 8.1% |
| e | 1383725 | 6.9% |
| s | 1315600 | 6.6% |
| r | 1283428 | 6.4% |
| l | 1102712 | 5.5% |
| u | 1023714 | 5.1% |
| n | 983410 | 4.9% |
| t | 942000 | 4.7% |
| Other values (56) | 6544507 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 17819535 | |
| Uppercase Letter | 2240735 | 11.2% |
| Decimal Number | 34 | < 0.1% |
| Dash Punctuation | 30 | < 0.1% |
| Other Punctuation | 8 | < 0.1% |
| Space Separator | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2206605 | |
| i | 1657025 | 9.3% |
| o | 1617618 | 9.1% |
| e | 1383725 | 7.8% |
| s | 1315600 | 7.4% |
| r | 1283428 | 7.2% |
| l | 1102712 | 6.2% |
| u | 1023714 | 5.7% |
| n | 983410 | 5.5% |
| t | 942000 | 5.3% |
| Other values (18) | 4303698 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 334484 | |
| C | 299923 | |
| A | 207393 | |
| S | 200915 | 9.0% |
| M | 150104 | 6.7% |
| L | 120892 | 5.4% |
| E | 116992 | 5.2% |
| T | 108267 | 4.8% |
| D | 102239 | 4.6% |
| B | 94126 | 4.2% |
| Other values (16) | 505400 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 10 | |
| 1 | 8 | |
| 4 | 6 | |
| 0 | 4 | 11.8% |
| 8 | 2 | 5.9% |
| 3 | 2 | 5.9% |
| 6 | 2 | 5.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 4 | |
| . | 3 | |
| ? | 1 | 12.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 30 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 20060270 | |
| Common | 74 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2206605 | 11.0% |
| i | 1657025 | 8.3% |
| o | 1617618 | 8.1% |
| e | 1383725 | 6.9% |
| s | 1315600 | 6.6% |
| r | 1283428 | 6.4% |
| l | 1102712 | 5.5% |
| u | 1023714 | 5.1% |
| n | 983410 | 4.9% |
| t | 942000 | 4.7% |
| Other values (44) | 6544433 |
Common
| Value | Count | Frequency (%) |
| - | 30 | |
| 2 | 10 | 13.5% |
| 1 | 8 | 10.8% |
| 4 | 6 | 8.1% |
| 0 | 4 | 5.4% |
| : | 4 | 5.4% |
| . | 3 | 4.1% |
| 8 | 2 | 2.7% |
| 3 | 2 | 2.7% |
| 6 | 2 | 2.7% |
| Other values (2) | 3 | 4.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20060323 | |
| None | 21 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2206605 | 11.0% |
| i | 1657025 | 8.3% |
| o | 1617618 | 8.1% |
| e | 1383725 | 6.9% |
| s | 1315600 | 6.6% |
| r | 1283428 | 6.4% |
| l | 1102712 | 5.5% |
| u | 1023714 | 5.1% |
| n | 983410 | 4.9% |
| t | 942000 | 4.7% |
| Other values (54) | 6544486 |
None
| Value | Count | Frequency (%) |
| ë | 20 | |
| ö | 1 | 4.8% |
subgenus
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 66.7% |
| Missing | 2361470 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 4 |
| Mean length | 4.333333333 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 33.3% |
Sample
| 1st row | true |
|---|---|
| 2nd row | false |
| 3rd row | true |
| Value | Count | Frequency (%) |
| true | 2 | |
| false | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 3 | |
| t | 2 | |
| r | 2 | |
| u | 2 | |
| f | 1 | 7.7% |
| a | 1 | 7.7% |
| l | 1 | 7.7% |
| s | 1 | 7.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3 | |
| t | 2 | |
| r | 2 | |
| u | 2 | |
| f | 1 | 7.7% |
| a | 1 | 7.7% |
| l | 1 | 7.7% |
| s | 1 | 7.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 13 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3 | |
| t | 2 | |
| r | 2 | |
| u | 2 | |
| f | 1 | 7.7% |
| a | 1 | 7.7% |
| l | 1 | 7.7% |
| s | 1 | 7.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 3 | |
| t | 2 | |
| r | 2 | |
| u | 2 | |
| f | 1 | 7.7% |
| a | 1 | 7.7% |
| l | 1 | 7.7% |
| s | 1 | 7.7% |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361471 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 21.5 |
| Mean length | 21.5 |
| Min length | 7 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 5410907 |
|---|---|
| 2nd row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| Value | Count | Frequency (%) |
| 5410907 | 1 | |
| 821cc27a-e3bb-4bc5-ac34-89ada245069d | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 4 | 9.3% |
| c | 4 | 9.3% |
| a | 4 | 9.3% |
| - | 4 | 9.3% |
| 5 | 3 | 7.0% |
| 0 | 3 | 7.0% |
| 9 | 3 | 7.0% |
| 2 | 3 | 7.0% |
| b | 3 | 7.0% |
| 1 | 2 | 4.7% |
| Other values (6) | 10 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 25 | |
| Lowercase Letter | 14 | |
| Dash Punctuation | 4 | 9.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 4 | |
| 5 | 3 | |
| 0 | 3 | |
| 9 | 3 | |
| 2 | 3 | |
| 1 | 2 | |
| 7 | 2 | |
| 8 | 2 | |
| 3 | 2 | |
| 6 | 1 | 4.0% |
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 4 | |
| a | 4 | |
| b | 3 | |
| d | 2 | |
| e | 1 | 7.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 29 | |
| Latin | 14 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 4 | |
| - | 4 | |
| 5 | 3 | |
| 0 | 3 | |
| 9 | 3 | |
| 2 | 3 | |
| 1 | 2 | |
| 7 | 2 | |
| 8 | 2 | |
| 3 | 2 |
Latin
| Value | Count | Frequency (%) |
| c | 4 | |
| a | 4 | |
| b | 3 | |
| d | 2 | |
| e | 1 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 43 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 4 | 9.3% |
| c | 4 | 9.3% |
| a | 4 | 9.3% |
| - | 4 | 9.3% |
| 5 | 3 | 7.0% |
| 0 | 3 | 7.0% |
| 9 | 3 | 7.0% |
| 2 | 3 | 7.0% |
| b | 3 | 7.0% |
| 1 | 2 | 4.7% |
| Other values (6) | 10 |
specificEpithet
Text
Missing 
| Distinct | 101231 |
|---|---|
| Distinct (%) | 4.9% |
| Missing | 306545 |
| Missing (%) | 13.0% |
| Memory size | 18.0 MiB |
Length
| Max length | 25 |
|---|---|
| Median length | 20 |
| Mean length | 8.923929208 |
| Min length | 2 |
Unique
| Unique | 40686 ? |
|---|---|
| Unique (%) | 2.0% |
Sample
| 1st row | lescurii |
|---|---|
| 2nd row | ochrophaeus |
| 3rd row | kinbergi |
| 4th row | adelphus |
| 5th row | couchii |
| Value | Count | Frequency (%) |
| cinereus | 20993 | 1.0% |
| americana | 5520 | 0.3% |
| gracilis | 5231 | 0.3% |
| canadensis | 4690 | 0.2% |
| maniculatus | 4077 | 0.2% |
| fuscus | 4025 | 0.2% |
| occidentalis | 3909 | 0.2% |
| montanus | 3857 | 0.2% |
| elegans | 3772 | 0.2% |
| carolinensis | 3302 | 0.2% |
| Other values (101221) | 1995552 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2365613 | |
| i | 2078357 | |
| s | 1544823 | 8.4% |
| e | 1334633 | 7.3% |
| r | 1236799 | 6.7% |
| u | 1199119 | 6.5% |
| n | 1159867 | 6.3% |
| l | 1147225 | 6.3% |
| t | 1010250 | 5.5% |
| o | 1001337 | 5.5% |
| Other values (31) | 4260009 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 18333151 | |
| Dash Punctuation | 4870 | < 0.1% |
| Decimal Number | 9 | < 0.1% |
| Uppercase Letter | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2365613 | |
| i | 2078357 | |
| s | 1544823 | 8.4% |
| e | 1334633 | 7.3% |
| r | 1236799 | 6.7% |
| u | 1199119 | 6.5% |
| n | 1159867 | 6.3% |
| l | 1147225 | 6.3% |
| t | 1010250 | 5.5% |
| o | 1001337 | 5.5% |
| Other values (21) | 4255128 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 0 | 2 | |
| 3 | 1 | |
| 5 | 1 | |
| 4 | 1 | |
| 9 | 1 | |
| 7 | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 1 | |
| S | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4870 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 18333153 | |
| Common | 4879 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2365613 | |
| i | 2078357 | |
| s | 1544823 | 8.4% |
| e | 1334633 | 7.3% |
| r | 1236799 | 6.7% |
| u | 1199119 | 6.5% |
| n | 1159867 | 6.3% |
| l | 1147225 | 6.3% |
| t | 1010250 | 5.5% |
| o | 1001337 | 5.5% |
| Other values (23) | 4255130 |
Common
| Value | Count | Frequency (%) |
| - | 4870 | |
| 1 | 2 | < 0.1% |
| 0 | 2 | < 0.1% |
| 3 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18337869 | |
| None | 163 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2365613 | |
| i | 2078357 | |
| s | 1544823 | 8.4% |
| e | 1334633 | 7.3% |
| r | 1236799 | 6.7% |
| u | 1199119 | 6.5% |
| n | 1159867 | 6.3% |
| l | 1147225 | 6.3% |
| t | 1010250 | 5.5% |
| o | 1001337 | 5.5% |
| Other values (26) | 4259846 |
None
| Value | Count | Frequency (%) |
| ü | 95 | |
| ö | 31 | 19.0% |
| ï | 18 | 11.0% |
| ë | 18 | 11.0% |
| ä | 1 | 0.6% |
Missing 
| Distinct | 16294 |
|---|---|
| Distinct (%) | 7.3% |
| Missing | 2138642 |
| Missing (%) | 90.6% |
| Memory size | 18.0 MiB |
Length
| Max length | 25 |
|---|---|
| Median length | 19 |
| Mean length | 8.952681629 |
| Min length | 1 |
Unique
| Unique | 5417 ? |
|---|---|
| Unique (%) | 2.4% |
Sample
| 1st row | cinnamomina |
|---|---|
| 2nd row | berlandieri |
| 3rd row | mellodora |
| 4th row | rubiginosa |
| 5th row | spergulariiforme |
| Value | Count | Frequency (%) |
| domesticus | 1270 | 0.6% |
| acuminatum | 1170 | 0.5% |
| pennsylvanicus | 1114 | 0.5% |
| cinereus | 977 | 0.4% |
| talpoides | 972 | 0.4% |
| carolinensis | 825 | 0.4% |
| occidentalis | 737 | 0.3% |
| mexicana | 726 | 0.3% |
| major | 669 | 0.3% |
| borealis | 646 | 0.3% |
| Other values (16284) | 213725 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 236092 | |
| i | 231855 | |
| s | 193161 | |
| e | 150019 | 7.5% |
| n | 135896 | 6.8% |
| r | 129914 | 6.5% |
| u | 129816 | 6.5% |
| l | 121955 | 6.1% |
| o | 110496 | 5.5% |
| c | 101725 | 5.1% |
| Other values (30) | 454006 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1994754 | |
| Dash Punctuation | 158 | < 0.1% |
| Decimal Number | 18 | < 0.1% |
| Other Punctuation | 3 | < 0.1% |
| Uppercase Letter | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 236092 | |
| i | 231855 | |
| s | 193161 | |
| e | 150019 | 7.5% |
| n | 135896 | 6.8% |
| r | 129914 | 6.5% |
| u | 129816 | 6.5% |
| l | 121955 | 6.1% |
| o | 110496 | 5.5% |
| c | 101725 | 5.1% |
| Other values (17) | 453825 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 4 | |
| 0 | 4 | |
| 1 | 4 | |
| 6 | 2 | |
| 4 | 1 | 5.6% |
| 3 | 1 | 5.6% |
| 5 | 1 | 5.6% |
| 9 | 1 | 5.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 2 | |
| . | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1 | |
| Z | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 158 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1994756 | |
| Common | 179 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 236092 | |
| i | 231855 | |
| s | 193161 | |
| e | 150019 | 7.5% |
| n | 135896 | 6.8% |
| r | 129914 | 6.5% |
| u | 129816 | 6.5% |
| l | 121955 | 6.1% |
| o | 110496 | 5.5% |
| c | 101725 | 5.1% |
| Other values (19) | 453827 |
Common
| Value | Count | Frequency (%) |
| - | 158 | |
| 2 | 4 | 2.2% |
| 0 | 4 | 2.2% |
| 1 | 4 | 2.2% |
| : | 2 | 1.1% |
| 6 | 2 | 1.1% |
| 4 | 1 | 0.6% |
| 3 | 1 | 0.6% |
| 5 | 1 | 0.6% |
| 9 | 1 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1994927 | |
| None | 8 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 236092 | |
| i | 231855 | |
| s | 193161 | |
| e | 150019 | 7.5% |
| n | 135896 | 6.8% |
| r | 129914 | 6.5% |
| u | 129816 | 6.5% |
| l | 121955 | 6.1% |
| o | 110496 | 5.5% |
| c | 101725 | 5.1% |
| Other values (29) | 453998 |
None
| Value | Count | Frequency (%) |
| ö | 8 |
cultivarEpithet
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361470 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 7 |
| Mean length | 9 |
| Min length | 7 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | OCEANIA |
|---|---|
| 2nd row | 7707728 |
| 3rd row | LATIN_AMERICA |
| Value | Count | Frequency (%) |
| oceania | 1 | |
| 7707728 | 1 | |
| latin_america | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 5 | |
| 7 | 4 | |
| I | 3 | |
| C | 2 | 7.4% |
| E | 2 | 7.4% |
| N | 2 | 7.4% |
| O | 1 | 3.7% |
| 0 | 1 | 3.7% |
| 2 | 1 | 3.7% |
| 8 | 1 | 3.7% |
| Other values (5) | 5 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 19 | |
| Decimal Number | 7 | 25.9% |
| Connector Punctuation | 1 | 3.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 5 | |
| I | 3 | |
| C | 2 | 10.5% |
| E | 2 | 10.5% |
| N | 2 | 10.5% |
| O | 1 | 5.3% |
| L | 1 | 5.3% |
| T | 1 | 5.3% |
| M | 1 | 5.3% |
| R | 1 | 5.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 4 | |
| 0 | 1 | 14.3% |
| 2 | 1 | 14.3% |
| 8 | 1 | 14.3% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 19 | |
| Common | 8 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 5 | |
| I | 3 | |
| C | 2 | 10.5% |
| E | 2 | 10.5% |
| N | 2 | 10.5% |
| O | 1 | 5.3% |
| L | 1 | 5.3% |
| T | 1 | 5.3% |
| M | 1 | 5.3% |
| R | 1 | 5.3% |
Common
| Value | Count | Frequency (%) |
| 7 | 4 | |
| 0 | 1 | 12.5% |
| 2 | 1 | 12.5% |
| 8 | 1 | 12.5% |
| _ | 1 | 12.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 27 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 5 | |
| 7 | 4 | |
| I | 3 | |
| C | 2 | 7.4% |
| E | 2 | 7.4% |
| N | 2 | 7.4% |
| O | 1 | 3.7% |
| 0 | 1 | 3.7% |
| 2 | 1 | 3.7% |
| 8 | 1 | 3.7% |
| Other values (5) | 5 |
taxonRank
Text
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 10 |
| Missing (%) | < 0.1% |
| Memory size | 18.0 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 7 |
| Mean length | 6.997910194 |
| Min length | 3 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | FAMILY |
|---|---|
| 2nd row | SPECIES |
| 3rd row | SPECIES |
| 4th row | ORDER |
| 5th row | SPECIES |
| Value | Count | Frequency (%) |
| species | 1832195 | |
| genus | 185798 | 7.9% |
| subspecies | 170097 | 7.2% |
| family | 70045 | 3.0% |
| variety | 50926 | 2.2% |
| phylum | 17766 | 0.8% |
| class | 16815 | 0.7% |
| order | 8393 | 0.4% |
| kingdom | 7620 | 0.3% |
| form | 1804 | 0.1% |
| Other values (3) | 4 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 4394109 | |
| E | 4249704 | |
| I | 2130885 | |
| P | 2020058 | |
| C | 2019109 | |
| U | 373662 | 2.3% |
| N | 193422 | 1.2% |
| G | 193418 | 1.2% |
| B | 170097 | 1.0% |
| Y | 138737 | 0.8% |
| Other values (14) | 642105 | 3.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 16525301 | |
| Decimal Number | 3 | < 0.1% |
| Connector Punctuation | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 4394109 | |
| E | 4249704 | |
| I | 2130885 | |
| P | 2020058 | |
| C | 2019109 | |
| U | 373662 | 2.3% |
| N | 193422 | 1.2% |
| G | 193418 | 1.2% |
| B | 170097 | 1.0% |
| Y | 138737 | 0.8% |
| Other values (11) | 642100 | 3.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 0 | 1 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16525301 | |
| Common | 5 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 4394109 | |
| E | 4249704 | |
| I | 2130885 | |
| P | 2020058 | |
| C | 2019109 | |
| U | 373662 | 2.3% |
| N | 193422 | 1.2% |
| G | 193418 | 1.2% |
| B | 170097 | 1.0% |
| Y | 138737 | 0.8% |
| Other values (11) | 642100 | 3.9% |
Common
| Value | Count | Frequency (%) |
| _ | 2 | |
| 2 | 2 | |
| 0 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16525306 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 4394109 | |
| E | 4249704 | |
| I | 2130885 | |
| P | 2020058 | |
| C | 2019109 | |
| U | 373662 | 2.3% |
| N | 193422 | 1.2% |
| G | 193418 | 1.2% |
| B | 170097 | 1.0% |
| Y | 138737 | 0.8% |
| Other values (14) | 642105 | 3.9% |
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361470 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 51 |
|---|---|
| Median length | 3 |
| Mean length | 19 |
| Min length | 3 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | AUS |
|---|---|
| 2nd row | 414 |
| 3rd row | Plantae, Dicotyledonae (basal), Laurales, Lauraceae |
| Value | Count | Frequency (%) |
| aus | 1 | |
| 414 | 1 | |
| plantae | 1 | |
| dicotyledonae | 1 | |
| basal | 1 | |
| laurales | 1 | |
| lauraceae | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 10 | |
| e | 6 | 10.5% |
| 4 | 7.0% | |
| l | 4 | 7.0% |
| , | 3 | 5.3% |
| r | 2 | 3.5% |
| o | 2 | 3.5% |
| c | 2 | 3.5% |
| s | 2 | 3.5% |
| t | 2 | 3.5% |
| Other values (16) | 20 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 38 | |
| Uppercase Letter | 7 | 12.3% |
| Space Separator | 4 | 7.0% |
| Other Punctuation | 3 | 5.3% |
| Decimal Number | 3 | 5.3% |
| Close Punctuation | 1 | 1.8% |
| Open Punctuation | 1 | 1.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 10 | |
| e | 6 | |
| l | 4 | 10.5% |
| r | 2 | 5.3% |
| o | 2 | 5.3% |
| c | 2 | 5.3% |
| s | 2 | 5.3% |
| t | 2 | 5.3% |
| n | 2 | 5.3% |
| u | 2 | 5.3% |
| Other values (4) | 4 | 10.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 2 | |
| A | 1 | |
| U | 1 | |
| P | 1 | |
| S | 1 | |
| D | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 2 | |
| 1 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 4 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 3 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 45 | |
| Common | 12 | 21.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 10 | |
| e | 6 | |
| l | 4 | 8.9% |
| r | 2 | 4.4% |
| o | 2 | 4.4% |
| c | 2 | 4.4% |
| s | 2 | 4.4% |
| t | 2 | 4.4% |
| n | 2 | 4.4% |
| L | 2 | 4.4% |
| Other values (10) | 11 |
Common
| Value | Count | Frequency (%) |
| 4 | ||
| , | 3 | |
| 4 | 2 | |
| ) | 1 | 8.3% |
| ( | 1 | 8.3% |
| 1 | 1 | 8.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 57 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 10 | |
| e | 6 | 10.5% |
| 4 | 7.0% | |
| l | 4 | 7.0% |
| , | 3 | 5.3% |
| r | 2 | 3.5% |
| o | 2 | 3.5% |
| c | 2 | 3.5% |
| s | 2 | 3.5% |
| t | 2 | 3.5% |
| Other values (16) | 20 |
vernacularName
Text
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361469 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 7.5 |
| Mean length | 7 |
| Min length | 4 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Australia |
|---|---|
| 2nd row | 8801 |
| 3rd row | HOLOTYPE |
| 4th row | Plantae |
| Value | Count | Frequency (%) |
| australia | 1 | |
| 8801 | 1 | |
| holotype | 1 | |
| plantae | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4 | 14.3% |
| O | 2 | 7.1% |
| t | 2 | 7.1% |
| l | 2 | 7.1% |
| P | 2 | 7.1% |
| 8 | 2 | 7.1% |
| A | 1 | 3.6% |
| n | 1 | 3.6% |
| E | 1 | 3.6% |
| Y | 1 | 3.6% |
| Other values (10) | 10 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 14 | |
| Uppercase Letter | 10 | |
| Decimal Number | 4 | 14.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4 | |
| t | 2 | |
| l | 2 | |
| n | 1 | 7.1% |
| u | 1 | 7.1% |
| i | 1 | 7.1% |
| r | 1 | 7.1% |
| s | 1 | 7.1% |
| e | 1 | 7.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 2 | |
| P | 2 | |
| A | 1 | |
| E | 1 | |
| Y | 1 | |
| T | 1 | |
| L | 1 | |
| H | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 2 | |
| 1 | 1 | |
| 0 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 24 | |
| Common | 4 | 14.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4 | |
| O | 2 | 8.3% |
| t | 2 | 8.3% |
| l | 2 | 8.3% |
| P | 2 | 8.3% |
| A | 1 | 4.2% |
| n | 1 | 4.2% |
| E | 1 | 4.2% |
| Y | 1 | 4.2% |
| T | 1 | 4.2% |
| Other values (7) | 7 |
Common
| Value | Count | Frequency (%) |
| 8 | 2 | |
| 1 | 1 | |
| 0 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 28 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4 | 14.3% |
| O | 2 | 7.1% |
| t | 2 | 7.1% |
| l | 2 | 7.1% |
| P | 2 | 7.1% |
| 8 | 2 | 7.1% |
| A | 1 | 3.6% |
| n | 1 | 3.6% |
| E | 1 | 3.6% |
| Y | 1 | 3.6% |
| Other values (10) | 10 |
Missing 
| Distinct | 5 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361468 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 16 |
|---|---|
| Median length | 15 |
| Mean length | 11.4 |
| Min length | 7 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | AUS.6_1 |
|---|---|
| 2nd row | Howell, Tiffany |
| 3rd row | 3161935 |
| 4th row | Maccallum, G. A. |
| 5th row | Tracheophyta |
| Value | Count | Frequency (%) |
| aus.6_1 | 1 | |
| howell | 1 | |
| tiffany | 1 | |
| 3161935 | 1 | |
| maccallum | 1 | |
| g | 1 | |
| a | 1 | |
| tracheophyta | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 5 | 8.8% |
| l | 4 | 7.0% |
| . | 3 | 5.3% |
| c | 3 | 5.3% |
| 1 | 3 | 5.3% |
| 3 | 5.3% | |
| A | 2 | 3.5% |
| , | 2 | 3.5% |
| h | 2 | 3.5% |
| 3 | 2 | 3.5% |
| Other values (22) | 28 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 30 | |
| Decimal Number | 9 | 15.8% |
| Uppercase Letter | 9 | 15.8% |
| Other Punctuation | 5 | 8.8% |
| Space Separator | 3 | 5.3% |
| Connector Punctuation | 1 | 1.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 5 | |
| l | 4 | |
| c | 3 | |
| h | 2 | 6.7% |
| y | 2 | 6.7% |
| f | 2 | 6.7% |
| e | 2 | 6.7% |
| o | 2 | 6.7% |
| p | 1 | 3.3% |
| r | 1 | 3.3% |
| Other values (6) | 6 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2 | |
| T | 2 | |
| M | 1 | |
| S | 1 | |
| G | 1 | |
| H | 1 | |
| U | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 3 | |
| 3 | 2 | |
| 6 | 2 | |
| 5 | 1 | 11.1% |
| 9 | 1 | 11.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3 | |
| , | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 3 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 39 | |
| Common | 18 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 5 | 12.8% |
| l | 4 | 10.3% |
| c | 3 | 7.7% |
| A | 2 | 5.1% |
| h | 2 | 5.1% |
| y | 2 | 5.1% |
| T | 2 | 5.1% |
| f | 2 | 5.1% |
| e | 2 | 5.1% |
| o | 2 | 5.1% |
| Other values (13) | 13 |
Common
| Value | Count | Frequency (%) |
| . | 3 | |
| 1 | 3 | |
| 3 | ||
| , | 2 | |
| 3 | 2 | |
| 6 | 2 | |
| 5 | 1 | 5.6% |
| 9 | 1 | 5.6% |
| _ | 1 | 5.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 57 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 5 | 8.8% |
| l | 4 | 7.0% |
| . | 3 | 5.3% |
| c | 3 | 5.3% |
| 1 | 3 | 5.3% |
| 3 | 5.3% | |
| A | 2 | 3.5% |
| , | 2 | 3.5% |
| h | 2 | 3.5% |
| 3 | 2 | 3.5% |
| Other values (22) | 28 |
taxonomicStatus
Text
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5772 |
| Missing (%) | 0.2% |
| Memory size | 18.0 MiB |
Length
| Max length | 77 |
|---|---|
| Median length | 8 |
| Mean length | 7.830022146 |
| Min length | 7 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | ACCEPTED |
|---|---|
| 2nd row | SYNONYM |
| 3rd row | ACCEPTED |
| 4th row | ACCEPTED |
| 5th row | ACCEPTED |
| Value | Count | Frequency (%) |
| accepted | 1936163 | |
| synonym | 400501 | 17.0% |
| doubtful | 19034 | 0.8% |
| northern | 1 | < 0.1% |
| territory | 1 | < 0.1% |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84 | 1 | < 0.1% |
| magnoliopsida | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 3872333 | |
| C | 3872331 | |
| T | 1955203 | |
| D | 1955203 | |
| A | 1936167 | |
| P | 1936163 | |
| N | 801007 | 4.3% |
| Y | 801002 | 4.3% |
| O | 419539 | 2.3% |
| S | 400506 | 2.2% |
| Other values (29) | 495737 | 2.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 18445152 | |
| Lowercase Letter | 27 | < 0.1% |
| Connector Punctuation | 8 | < 0.1% |
| Decimal Number | 2 | < 0.1% |
| Space Separator | 1 | < 0.1% |
| Other Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 3872333 | |
| C | 3872331 | |
| T | 1955203 | |
| D | 1955203 | |
| A | 1936167 | |
| P | 1936163 | |
| N | 801007 | 4.3% |
| Y | 801002 | 4.3% |
| O | 419539 | 2.3% |
| S | 400506 | 2.2% |
| Other values (10) | 495698 | 2.7% |
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 5 | |
| o | 4 | |
| i | 3 | |
| a | 2 | 7.4% |
| e | 2 | 7.4% |
| n | 2 | 7.4% |
| t | 2 | 7.4% |
| y | 1 | 3.7% |
| h | 1 | 3.7% |
| g | 1 | 3.7% |
| Other values (4) | 4 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 1 | |
| 4 | 1 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 8 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 18445179 | |
| Common | 12 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 3872333 | |
| C | 3872331 | |
| T | 1955203 | |
| D | 1955203 | |
| A | 1936167 | |
| P | 1936163 | |
| N | 801007 | 4.3% |
| Y | 801002 | 4.3% |
| O | 419539 | 2.3% |
| S | 400506 | 2.2% |
| Other values (24) | 495725 | 2.7% |
Common
| Value | Count | Frequency (%) |
| _ | 8 | |
| 1 | 8.3% | |
| ; | 1 | 8.3% |
| 8 | 1 | 8.3% |
| 4 | 1 | 8.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18445191 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 3872333 | |
| C | 3872331 | |
| T | 1955203 | |
| D | 1955203 | |
| A | 1936167 | |
| P | 1936163 | |
| N | 801007 | 4.3% |
| Y | 801002 | 4.3% |
| O | 419539 | 2.3% |
| S | 400506 | 2.2% |
| Other values (29) | 495737 | 2.7% |
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361469 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 8.75 |
| Min length | 7 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | AUS.6.12_1 |
|---|---|
| 2nd row | 5410907 |
| 3rd row | StillImage |
| 4th row | Laurales |
| Value | Count | Frequency (%) |
| aus.6.12_1 | 1 | |
| 5410907 | 1 | |
| stillimage | 1 | |
| laurales | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 3 | 8.6% |
| 1 | 3 | 8.6% |
| a | 3 | 8.6% |
| S | 2 | 5.7% |
| . | 2 | 5.7% |
| e | 2 | 5.7% |
| 0 | 2 | 5.7% |
| A | 1 | 2.9% |
| r | 1 | 2.9% |
| u | 1 | 2.9% |
| Other values (15) | 15 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 15 | |
| Decimal Number | 11 | |
| Uppercase Letter | 6 | 17.1% |
| Other Punctuation | 2 | 5.7% |
| Connector Punctuation | 1 | 2.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 3 | |
| a | 3 | |
| e | 2 | |
| r | 1 | 6.7% |
| u | 1 | 6.7% |
| g | 1 | 6.7% |
| m | 1 | 6.7% |
| i | 1 | 6.7% |
| t | 1 | 6.7% |
| s | 1 | 6.7% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 3 | |
| 0 | 2 | |
| 7 | 1 | 9.1% |
| 9 | 1 | 9.1% |
| 4 | 1 | 9.1% |
| 5 | 1 | 9.1% |
| 2 | 1 | 9.1% |
| 6 | 1 | 9.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 2 | |
| A | 1 | |
| L | 1 | |
| I | 1 | |
| U | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 21 | |
| Common | 14 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 3 | |
| a | 3 | |
| S | 2 | 9.5% |
| e | 2 | 9.5% |
| A | 1 | 4.8% |
| r | 1 | 4.8% |
| u | 1 | 4.8% |
| L | 1 | 4.8% |
| g | 1 | 4.8% |
| m | 1 | 4.8% |
| Other values (5) | 5 |
Common
| Value | Count | Frequency (%) |
| 1 | 3 | |
| . | 2 | |
| 0 | 2 | |
| 7 | 1 | 7.1% |
| 9 | 1 | 7.1% |
| 4 | 1 | 7.1% |
| 5 | 1 | 7.1% |
| _ | 1 | 7.1% |
| 2 | 1 | 7.1% |
| 6 | 1 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 35 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 3 | 8.6% |
| 1 | 3 | 8.6% |
| a | 3 | 8.6% |
| S | 2 | 5.7% |
| . | 2 | 5.7% |
| e | 2 | 5.7% |
| 0 | 2 | 5.7% |
| A | 1 | 2.9% |
| r | 1 | 2.9% |
| u | 1 | 2.9% |
| Other values (15) | 15 |
taxonRemarks
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361470 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 22 |
|---|---|
| Median length | 10 |
| Mean length | 12 |
| Min length | 4 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Roper Gulf |
|---|---|
| 2nd row | Campanula rotundifolia |
| 3rd row | true |
| Value | Count | Frequency (%) |
| roper | 1 | |
| gulf | 1 | |
| campanula | 1 | |
| rotundifolia | 1 | |
| true | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4 | |
| u | 4 | |
| l | 3 | 8.3% |
| o | 3 | 8.3% |
| r | 3 | 8.3% |
| t | 2 | 5.6% |
| n | 2 | 5.6% |
| f | 2 | 5.6% |
| i | 2 | 5.6% |
| 2 | 5.6% | |
| Other values (7) | 9 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 31 | |
| Uppercase Letter | 3 | 8.3% |
| Space Separator | 2 | 5.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4 | |
| u | 4 | |
| l | 3 | |
| o | 3 | |
| r | 3 | |
| t | 2 | |
| n | 2 | |
| f | 2 | |
| i | 2 | |
| e | 2 | |
| Other values (3) | 4 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 1 | |
| C | 1 | |
| R | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 34 | |
| Common | 2 | 5.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4 | |
| u | 4 | |
| l | 3 | |
| o | 3 | |
| r | 3 | |
| t | 2 | 5.9% |
| n | 2 | 5.9% |
| f | 2 | 5.9% |
| i | 2 | 5.9% |
| e | 2 | 5.9% |
| Other values (6) | 7 |
Common
| Value | Count | Frequency (%) |
| 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 36 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4 | |
| u | 4 | |
| l | 3 | 8.3% |
| o | 3 | 8.3% |
| r | 3 | 8.3% |
| t | 2 | 5.6% |
| n | 2 | 5.6% |
| f | 2 | 5.6% |
| i | 2 | 5.6% |
| 2 | 5.6% | |
| Other values (7) | 9 |
datasetKey
Text
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 10 |
| Missing (%) | < 0.1% |
| Memory size | 18.0 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 36 |
| Mean length | 35.99997078 |
| Min length | 5 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
|---|---|
| 2nd row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| 3rd row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| 4th row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| 5th row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| Value | Count | Frequency (%) |
| 821cc27a-e3bb-4bc5-ac34-89ada245069d | 2361460 | |
| campanula | 1 | < 0.1% |
| rotundifolia | 1 | < 0.1% |
| l | 1 | < 0.1% |
| false | 1 | < 0.1% |
| lauraceae | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 9445848 | |
| c | 9445841 | |
| - | 9445840 | |
| 2 | 7084380 | |
| b | 7084380 | |
| 4 | 7084380 | |
| d | 4722921 | 5.6% |
| 8 | 4722920 | 5.6% |
| 3 | 4722920 | 5.6% |
| 5 | 4722920 | 5.6% |
| Other values (21) | 16530249 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 42506280 | |
| Lowercase Letter | 33060473 | |
| Dash Punctuation | 9445840 | 11.1% |
| Uppercase Letter | 3 | < 0.1% |
| Space Separator | 2 | < 0.1% |
| Other Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 9445848 | |
| c | 9445841 | |
| b | 7084380 | |
| d | 4722921 | |
| e | 2361463 | 7.1% |
| u | 3 | < 0.1% |
| l | 3 | < 0.1% |
| r | 2 | < 0.1% |
| f | 2 | < 0.1% |
| i | 2 | < 0.1% |
| Other values (6) | 8 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 7084380 | |
| 4 | 7084380 | |
| 8 | 4722920 | |
| 3 | 4722920 | |
| 5 | 4722920 | |
| 9 | 4722920 | |
| 0 | 2361460 | 5.6% |
| 6 | 2361460 | 5.6% |
| 7 | 2361460 | 5.6% |
| 1 | 2361460 | 5.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 2 | |
| C | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 9445840 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 51952123 | |
| Latin | 33060476 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 9445848 | |
| c | 9445841 | |
| b | 7084380 | |
| d | 4722921 | |
| e | 2361463 | 7.1% |
| u | 3 | < 0.1% |
| l | 3 | < 0.1% |
| r | 2 | < 0.1% |
| L | 2 | < 0.1% |
| f | 2 | < 0.1% |
| Other values (8) | 11 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| - | 9445840 | |
| 2 | 7084380 | |
| 4 | 7084380 | |
| 8 | 4722920 | |
| 3 | 4722920 | |
| 5 | 4722920 | |
| 9 | 4722920 | |
| 0 | 2361460 | 4.5% |
| 6 | 2361460 | 4.5% |
| 7 | 2361460 | 4.5% |
| Other values (3) | 2361463 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 85012599 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 9445848 | |
| c | 9445841 | |
| - | 9445840 | |
| 2 | 7084380 | |
| b | 7084380 | |
| 4 | 7084380 | |
| d | 4722921 | 5.6% |
| 8 | 4722920 | 5.6% |
| 3 | 4722920 | 5.6% |
| 5 | 4722920 | 5.6% |
| Other values (21) | 16530249 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 11 |
| Missing (%) | < 0.1% |
| Memory size | 18.0 MiB |
Length
| Max length | 22 |
|---|---|
| Median length | 2 |
| Mean length | 2.000010587 |
| Min length | 2 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | US |
|---|---|
| 2nd row | US |
| 3rd row | US |
| 4th row | US |
| 5th row | US |
| Value | Count | Frequency (%) |
| us | 2361460 | |
| campanula | 1 | < 0.1% |
| rotundifolia | 1 | < 0.1% |
| 3155772 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 2361460 | |
| S | 2361460 | |
| a | 4 | < 0.1% |
| i | 2 | < 0.1% |
| 7 | 2 | < 0.1% |
| 5 | 2 | < 0.1% |
| n | 2 | < 0.1% |
| u | 2 | < 0.1% |
| l | 2 | < 0.1% |
| o | 2 | < 0.1% |
| Other values (11) | 11 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 4722921 | |
| Lowercase Letter | 20 | < 0.1% |
| Decimal Number | 7 | < 0.1% |
| Space Separator | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4 | |
| i | 2 | |
| n | 2 | |
| u | 2 | |
| l | 2 | |
| o | 2 | |
| f | 1 | 5.0% |
| r | 1 | 5.0% |
| d | 1 | 5.0% |
| t | 1 | 5.0% |
| Other values (2) | 2 |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 2 | |
| 5 | 2 | |
| 1 | 1 | |
| 3 | 1 | |
| 2 | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 2361460 | |
| S | 2361460 | |
| C | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4722941 | |
| Common | 8 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 2361460 | |
| S | 2361460 | |
| a | 4 | < 0.1% |
| i | 2 | < 0.1% |
| n | 2 | < 0.1% |
| u | 2 | < 0.1% |
| l | 2 | < 0.1% |
| o | 2 | < 0.1% |
| f | 1 | < 0.1% |
| r | 1 | < 0.1% |
| Other values (5) | 5 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 7 | 2 | |
| 5 | 2 | |
| 1 | 1 | |
| 3 | 1 | |
| 1 | ||
| 2 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4722949 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 2361460 | |
| S | 2361460 | |
| a | 4 | < 0.1% |
| i | 2 | < 0.1% |
| 7 | 2 | < 0.1% |
| 5 | 2 | < 0.1% |
| n | 2 | < 0.1% |
| u | 2 | < 0.1% |
| l | 2 | < 0.1% |
| o | 2 | < 0.1% |
| Other values (11) | 11 | < 0.1% |
lastInterpreted
Text
| Distinct | 210763 |
|---|---|
| Distinct (%) | 8.9% |
| Missing | 10 |
| Missing (%) | < 0.1% |
| Memory size | 18.0 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 23.99606473 |
| Min length | 2 |
Unique
| Unique | 7659 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | 2024-12-02T13:59:36.683Z |
|---|---|
| 2nd row | 2024-12-02T13:59:14.817Z |
| 3rd row | 2024-12-02T13:57:42.802Z |
| 4th row | 2024-12-02T13:59:13.837Z |
| 5th row | 2024-12-02T13:57:45.358Z |
| Value | Count | Frequency (%) |
| 2024-12-02t13:57:25.039z | 46 | < 0.1% |
| 2024-12-02t13:57:24.083z | 45 | < 0.1% |
| 2024-12-02t13:57:28.833z | 45 | < 0.1% |
| 2024-12-02t13:57:45.003z | 45 | < 0.1% |
| 2024-12-02t13:57:52.915z | 44 | < 0.1% |
| 2024-12-02t13:57:34.491z | 44 | < 0.1% |
| 2024-12-02t13:57:52.924z | 43 | < 0.1% |
| 2024-12-02t13:57:43.166z | 43 | < 0.1% |
| 2024-12-02t13:57:52.893z | 42 | < 0.1% |
| 2024-12-02t13:57:42.743z | 42 | < 0.1% |
| Other values (210753) | 2361024 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 10789974 | |
| 0 | 5989862 | |
| 1 | 5962966 | |
| - | 4722920 | |
| : | 4722920 | |
| 4 | 3794952 | 6.7% |
| 5 | 3740549 | 6.6% |
| 3 | 3738232 | 6.6% |
| Z | 2361460 | 4.2% |
| T | 2361460 | 4.2% |
| Other values (9) | 8480524 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 40137903 | |
| Other Punctuation | 7082072 | 12.5% |
| Uppercase Letter | 4722924 | 8.3% |
| Dash Punctuation | 4722920 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 10789974 | |
| 0 | 5989862 | |
| 1 | 5962966 | |
| 4 | 3794952 | 9.5% |
| 5 | 3740549 | 9.3% |
| 3 | 3738232 | 9.3% |
| 7 | 1827734 | 4.6% |
| 9 | 1516897 | 3.8% |
| 6 | 1417013 | 3.5% |
| 8 | 1359724 | 3.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| Z | 2361460 | |
| T | 2361460 | |
| N | 1 | < 0.1% |
| E | 1 | < 0.1% |
| L | 1 | < 0.1% |
| C | 1 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 4722920 | |
| . | 2359152 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4722920 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 51942895 | |
| Latin | 4722924 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 10789974 | |
| 0 | 5989862 | |
| 1 | 5962966 | |
| - | 4722920 | |
| : | 4722920 | |
| 4 | 3794952 | 7.3% |
| 5 | 3740549 | 7.2% |
| 3 | 3738232 | 7.2% |
| . | 2359152 | 4.5% |
| 7 | 1827734 | 3.5% |
| Other values (3) | 4293634 | 8.3% |
Latin
| Value | Count | Frequency (%) |
| Z | 2361460 | |
| T | 2361460 | |
| N | 1 | < 0.1% |
| E | 1 | < 0.1% |
| L | 1 | < 0.1% |
| C | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 56665819 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 10789974 | |
| 0 | 5989862 | |
| 1 | 5962966 | |
| - | 4722920 | |
| : | 4722920 | |
| 4 | 3794952 | 6.7% |
| 5 | 3740549 | 6.6% |
| 3 | 3738232 | 6.6% |
| Z | 2361460 | 4.2% |
| T | 2361460 | 4.2% |
| Other values (9) | 8480524 |
elevation
Text
Missing 
| Distinct | 5275 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 1813940 |
| Missing (%) | 76.8% |
| Memory size | 18.0 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 17 |
| Mean length | 5.344459603 |
| Min length | 1 |
Unique
| Unique | 992 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | 1097.5 |
|---|---|
| 2nd row | 140.0 |
| 3rd row | 2880.0 |
| 4th row | 1219.0 |
| 5th row | 1100.0 |
| Value | Count | Frequency (%) |
| 1000.0 | 7671 | 1.4% |
| 100.0 | 7194 | 1.3% |
| 200.0 | 6722 | 1.2% |
| 500.0 | 6255 | 1.1% |
| 300.0 | 6128 | 1.1% |
| 1500.0 | 5575 | 1.0% |
| 800.0 | 5408 | 1.0% |
| 900.0 | 5284 | 1.0% |
| 1200.0 | 5262 | 1.0% |
| 400.0 | 5236 | 1.0% |
| Other values (5246) | 486798 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1005275 | |
| . | 547531 | |
| 1 | 294369 | 10.1% |
| 5 | 218789 | 7.5% |
| 2 | 216684 | 7.4% |
| 3 | 143662 | 4.9% |
| 4 | 113390 | 3.9% |
| 7 | 104322 | 3.6% |
| 6 | 101282 | 3.5% |
| 8 | 94460 | 3.2% |
| Other values (5) | 86504 | 3.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2378653 | |
| Other Punctuation | 547531 | 18.7% |
| Dash Punctuation | 81 | < 0.1% |
| Uppercase Letter | 3 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1005275 | |
| 1 | 294369 | 12.4% |
| 5 | 218789 | 9.2% |
| 2 | 216684 | 9.1% |
| 3 | 143662 | 6.0% |
| 4 | 113390 | 4.8% |
| 7 | 104322 | 4.4% |
| 6 | 101282 | 4.3% |
| 8 | 94460 | 4.0% |
| 9 | 86420 | 3.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1 | |
| M | 1 | |
| L | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 547531 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 81 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2926265 | |
| Latin | 3 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1005275 | |
| . | 547531 | |
| 1 | 294369 | 10.1% |
| 5 | 218789 | 7.5% |
| 2 | 216684 | 7.4% |
| 3 | 143662 | 4.9% |
| 4 | 113390 | 3.9% |
| 7 | 104322 | 3.6% |
| 6 | 101282 | 3.5% |
| 8 | 94460 | 3.2% |
| Other values (2) | 86501 | 3.0% |
Latin
| Value | Count | Frequency (%) |
| E | 1 | |
| M | 1 | |
| L | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2926268 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1005275 | |
| . | 547531 | |
| 1 | 294369 | 10.1% |
| 5 | 218789 | 7.5% |
| 2 | 216684 | 7.4% |
| 3 | 143662 | 4.9% |
| 4 | 113390 | 3.9% |
| 7 | 104322 | 3.6% |
| 6 | 101282 | 3.5% |
| 8 | 94460 | 3.2% |
| Other values (5) | 86504 | 3.0% |
Missing 
| Distinct | 941 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 2160162 |
| Missing (%) | 91.5% |
| Memory size | 18.0 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 3 |
| Mean length | 3.726492839 |
| Min length | 3 |
Unique
| Unique | 301 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 48.5 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 75.0 |
| 4th row | 50.0 |
| 5th row | 0.0 |
| Value | Count | Frequency (%) |
| 0.0 | 94148 | |
| 50.0 | 14946 | 7.4% |
| 100.0 | 9655 | 4.8% |
| 150.0 | 6530 | 3.2% |
| 25.0 | 6433 | 3.2% |
| 75.0 | 3885 | 1.9% |
| 200.0 | 3746 | 1.9% |
| 152.5 | 3332 | 1.7% |
| 15.0 | 2755 | 1.4% |
| 10.0 | 2349 | 1.2% |
| Other values (931) | 53532 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 345550 | |
| . | 201303 | |
| 5 | 78571 | 10.5% |
| 1 | 40957 | 5.5% |
| 2 | 31510 | 4.2% |
| 3 | 14551 | 1.9% |
| 7 | 13509 | 1.8% |
| 4 | 7670 | 1.0% |
| 6 | 7628 | 1.0% |
| 8 | 5571 | 0.7% |
| Other values (10) | 3364 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 548869 | |
| Other Punctuation | 201305 | 26.8% |
| Lowercase Letter | 5 | < 0.1% |
| Uppercase Letter | 3 | < 0.1% |
| Dash Punctuation | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 345550 | |
| 5 | 78571 | 14.3% |
| 1 | 40957 | 7.5% |
| 2 | 31510 | 5.7% |
| 3 | 14551 | 2.7% |
| 7 | 13509 | 2.5% |
| 4 | 7670 | 1.4% |
| 6 | 7628 | 1.4% |
| 8 | 5571 | 1.0% |
| 9 | 3352 | 0.6% |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2 | |
| r | 1 | |
| s | 1 | |
| a | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 | |
| T | 1 | |
| Z | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 201303 | |
| : | 2 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 750176 | |
| Latin | 8 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 345550 | |
| . | 201303 | |
| 5 | 78571 | 10.5% |
| 1 | 40957 | 5.5% |
| 2 | 31510 | 4.2% |
| 3 | 14551 | 1.9% |
| 7 | 13509 | 1.8% |
| 4 | 7670 | 1.0% |
| 6 | 7628 | 1.0% |
| 8 | 5571 | 0.7% |
| Other values (3) | 3356 | 0.4% |
Latin
| Value | Count | Frequency (%) |
| e | 2 | |
| P | 1 | |
| r | 1 | |
| s | 1 | |
| a | 1 | |
| T | 1 | |
| Z | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 750184 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 345550 | |
| . | 201303 | |
| 5 | 78571 | 10.5% |
| 1 | 40957 | 5.5% |
| 2 | 31510 | 4.2% |
| 3 | 14551 | 1.9% |
| 7 | 13509 | 1.8% |
| 4 | 7670 | 1.0% |
| 6 | 7628 | 1.0% |
| 8 | 5571 | 0.7% |
| Other values (10) | 3364 | 0.4% |
depth
Text
Missing 
| Distinct | 6333 |
|---|---|
| Distinct (%) | 2.4% |
| Missing | 2098489 |
| Missing (%) | 88.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 19 |
| Mean length | 4.326179539 |
| Min length | 3 |
Unique
| Unique | 1835 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | 9.1 |
|---|---|
| 2nd row | 200.0 |
| 3rd row | 3200.0 |
| 4th row | 40.0 |
| 5th row | 824.0 |
| Value | Count | Frequency (%) |
| 0.5 | 9154 | 3.5% |
| 1.0 | 5358 | 2.0% |
| 18.0 | 4141 | 1.6% |
| 3.0 | 4066 | 1.5% |
| 1.5 | 3594 | 1.4% |
| 12.0 | 3107 | 1.2% |
| 2.0 | 2972 | 1.1% |
| 6.0 | 2906 | 1.1% |
| 15.0 | 2686 | 1.0% |
| 24.0 | 2578 | 1.0% |
| Other values (6323) | 222422 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 279518 | |
| . | 262982 | |
| 5 | 110628 | 9.7% |
| 1 | 109984 | 9.7% |
| 2 | 82032 | 7.2% |
| 3 | 61134 | 5.4% |
| 4 | 55715 | 4.9% |
| 6 | 46039 | 4.0% |
| 8 | 45736 | 4.0% |
| 7 | 43807 | 3.9% |
| Other values (10) | 40141 | 3.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 874722 | |
| Other Punctuation | 262984 | 23.1% |
| Lowercase Letter | 5 | < 0.1% |
| Uppercase Letter | 3 | < 0.1% |
| Dash Punctuation | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 279518 | |
| 5 | 110628 | 12.6% |
| 1 | 109984 | 12.6% |
| 2 | 82032 | 9.4% |
| 3 | 61134 | 7.0% |
| 4 | 55715 | 6.4% |
| 6 | 46039 | 5.3% |
| 8 | 45736 | 5.2% |
| 7 | 43807 | 5.0% |
| 9 | 40129 | 4.6% |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2 | |
| r | 1 | |
| s | 1 | |
| a | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 | |
| T | 1 | |
| Z | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 262982 | |
| : | 2 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1137708 | |
| Latin | 8 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 279518 | |
| . | 262982 | |
| 5 | 110628 | 9.7% |
| 1 | 109984 | 9.7% |
| 2 | 82032 | 7.2% |
| 3 | 61134 | 5.4% |
| 4 | 55715 | 4.9% |
| 6 | 46039 | 4.0% |
| 8 | 45736 | 4.0% |
| 7 | 43807 | 3.9% |
| Other values (3) | 40133 | 3.5% |
Latin
| Value | Count | Frequency (%) |
| e | 2 | |
| P | 1 | |
| r | 1 | |
| s | 1 | |
| a | 1 | |
| T | 1 | |
| Z | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1137716 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 279518 | |
| . | 262982 | |
| 5 | 110628 | 9.7% |
| 1 | 109984 | 9.7% |
| 2 | 82032 | 7.2% |
| 3 | 61134 | 5.4% |
| 4 | 55715 | 4.9% |
| 6 | 46039 | 4.0% |
| 8 | 45736 | 4.0% |
| 7 | 43807 | 3.9% |
| Other values (10) | 40141 | 3.5% |
depthAccuracy
Text
Missing 
| Distinct | 1495 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 2120420 |
| Missing (%) | 89.8% |
| Memory size | 18.0 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 3 |
| Mean length | 3.319236848 |
| Min length | 3 |
Unique
| Unique | 326 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 10.0 |
| 5th row | 20.0 |
| Value | Count | Frequency (%) |
| 0.0 | 142418 | |
| 0.5 | 12237 | 5.1% |
| 3.0 | 8209 | 3.4% |
| 1.0 | 7211 | 3.0% |
| 1.5 | 6496 | 2.7% |
| 2.0 | 4403 | 1.8% |
| 2.5 | 4398 | 1.8% |
| 5.0 | 2956 | 1.2% |
| 4.5 | 2477 | 1.0% |
| 3.5 | 1687 | 0.7% |
| Other values (1485) | 48561 | 20.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 372794 | |
| . | 241051 | |
| 5 | 62907 | 7.9% |
| 1 | 31462 | 3.9% |
| 2 | 22080 | 2.8% |
| 9 | 20707 | 2.6% |
| 3 | 17061 | 2.1% |
| 4 | 11040 | 1.4% |
| 7 | 8028 | 1.0% |
| 6 | 7619 | 1.0% |
| Other values (5) | 5363 | 0.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 559057 | |
| Other Punctuation | 241051 | |
| Lowercase Letter | 4 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 372794 | |
| 5 | 62907 | 11.3% |
| 1 | 31462 | 5.6% |
| 2 | 22080 | 3.9% |
| 9 | 20707 | 3.7% |
| 3 | 17061 | 3.1% |
| 4 | 11040 | 2.0% |
| 7 | 8028 | 1.4% |
| 6 | 7619 | 1.4% |
| 8 | 5359 | 1.0% |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 1 | |
| r | 1 | |
| u | 1 | |
| e | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 241051 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 800108 | |
| Latin | 4 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 372794 | |
| . | 241051 | |
| 5 | 62907 | 7.9% |
| 1 | 31462 | 3.9% |
| 2 | 22080 | 2.8% |
| 9 | 20707 | 2.6% |
| 3 | 17061 | 2.1% |
| 4 | 11040 | 1.4% |
| 7 | 8028 | 1.0% |
| 6 | 7619 | 1.0% |
Latin
| Value | Count | Frequency (%) |
| t | 1 | |
| r | 1 | |
| u | 1 | |
| e | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 800112 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 372794 | |
| . | 241051 | |
| 5 | 62907 | 7.9% |
| 1 | 31462 | 3.9% |
| 2 | 22080 | 2.8% |
| 9 | 20707 | 2.6% |
| 3 | 17061 | 2.1% |
| 4 | 11040 | 1.4% |
| 7 | 8028 | 1.0% |
| 6 | 7619 | 1.0% |
| Other values (5) | 5363 | 0.7% |
distanceFromCentroidInMeters
Text
Missing 
| Distinct | 910 |
|---|---|
| Distinct (%) | 19.6% |
| Missing | 2356831 |
| Missing (%) | 99.8% |
| Memory size | 18.0 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 17 |
| Mean length | 14.57712193 |
| Min length | 3 |
Unique
| Unique | 466 ? |
|---|---|
| Unique (%) | 10.0% |
Sample
| 1st row | 365.13018771678105 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 3.650579245692265 |
| 4th row | 0.0 |
| 5th row | 0.0 |
| Value | Count | Frequency (%) |
| 0.0 | 906 | 19.5% |
| 511.15289545417056 | 224 | 4.8% |
| 4105.643932903784 | 143 | 3.1% |
| 365.9456782615661 | 97 | 2.1% |
| 2063.191632254214 | 87 | 1.9% |
| 4961.494346970892 | 60 | 1.3% |
| 2015.7207067821585 | 54 | 1.2% |
| 1436.265124532336 | 53 | 1.1% |
| 949.7490617483568 | 46 | 1.0% |
| 3997.886559051776 | 41 | 0.9% |
| Other values (900) | 2931 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 7071 | |
| 0 | 6998 | |
| 5 | 6883 | |
| 1 | 6804 | |
| 2 | 6313 | |
| 3 | 6228 | |
| 6 | 5979 | |
| 8 | 5873 | |
| 9 | 5775 | |
| 7 | 5102 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 63026 | |
| Other Punctuation | 4641 | 6.9% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 7071 | |
| 0 | 6998 | |
| 5 | 6883 | |
| 1 | 6804 | |
| 2 | 6313 | |
| 3 | 6228 | |
| 6 | 5979 | |
| 8 | 5873 | |
| 9 | 5775 | |
| 7 | 5102 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4641 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 67667 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 7071 | |
| 0 | 6998 | |
| 5 | 6883 | |
| 1 | 6804 | |
| 2 | 6313 | |
| 3 | 6228 | |
| 6 | 5979 | |
| 8 | 5873 | |
| 9 | 5775 | |
| 7 | 5102 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 67667 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 7071 | |
| 0 | 6998 | |
| 5 | 6883 | |
| 1 | 6804 | |
| 2 | 6313 | |
| 3 | 6228 | |
| 6 | 5979 | |
| 8 | 5873 | |
| 9 | 5775 | |
| 7 | 5102 |
issue
Text
| Distinct | 543 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 858 |
| Missing (%) | < 0.1% |
| Memory size | 18.0 MiB |
Length
| Max length | 210 |
|---|---|
| Median length | 48 |
| Mean length | 67.70638584 |
| Min length | 7 |
Unique
| Unique | 128 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;GEODETIC_DATUM_ASSUMED_WGS84;CONTINENT_DERIVED_FROM_COORDINATES;CONTINENT_INVALID |
|---|---|
| 2nd row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT |
| 3rd row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;GEODETIC_DATUM_ASSUMED_WGS84 |
| 4th row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT |
| 5th row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;GEODETIC_DATUM_ASSUMED_WGS84;CONTINENT_INVALID |
| Value | Count | Frequency (%) |
| occurrence_status_inferred_from_individual_count | 1322725 | |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84 | 251044 | 10.6% |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;continent_invalid | 152222 | 6.4% |
| occurrence_status_inferred_from_individual_count;continent_derived_from_country;continent_invalid | 105605 | 4.5% |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;continent_derived_from_coordinates;continent_invalid | 86812 | 3.7% |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;continent_derived_from_coordinates | 76729 | 3.3% |
| occurrence_status_inferred_from_individual_count;taxon_match_higherrank | 71534 | 3.0% |
| occurrence_status_inferred_from_individual_count;continent_derived_from_country | 67959 | 2.9% |
| occurrence_status_inferred_from_individual_count;taxon_match_fuzzy | 25595 | 1.1% |
| occurrence_status_inferred_from_individual_count;recorded_date_mismatch | 25313 | 1.1% |
| Other values (533) | 175077 | 7.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| _ | 15993110 | |
| E | 13659218 | 8.5% |
| R | 13467823 | 8.4% |
| N | 13134550 | 8.2% |
| I | 12670207 | 7.9% |
| C | 11725720 | 7.3% |
| U | 11078299 | 6.9% |
| T | 11054854 | 6.9% |
| D | 10756285 | 6.7% |
| O | 9975135 | 6.2% |
| Other values (34) | 36313509 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 140757398 | |
| Connector Punctuation | 15993110 | 10.0% |
| Other Punctuation | 1760707 | 1.1% |
| Decimal Number | 1317486 | 0.8% |
| Lowercase Letter | 9 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 13659218 | |
| R | 13467823 | |
| N | 13134550 | |
| I | 12670207 | |
| C | 11725720 | |
| U | 11078299 | |
| T | 11054854 | |
| D | 10756285 | |
| O | 9975135 | 7.1% |
| A | 7311251 | 5.2% |
| Other values (15) | 25924056 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 658738 | |
| 4 | 658738 | |
| 5 | 2 | < 0.1% |
| 3 | 2 | < 0.1% |
| 6 | 1 | < 0.1% |
| 1 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 2 | 1 | < 0.1% |
| 0 | 1 | < 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3 | |
| i | 1 | 11.1% |
| n | 1 | 11.1% |
| c | 1 | 11.1% |
| m | 1 | 11.1% |
| r | 1 | 11.1% |
| e | 1 | 11.1% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 15993110 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 1760707 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 140757407 | |
| Common | 19071303 | 11.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 13659218 | |
| R | 13467823 | |
| N | 13134550 | |
| I | 12670207 | |
| C | 11725720 | |
| U | 11078299 | |
| T | 11054854 | |
| D | 10756285 | |
| O | 9975135 | 7.1% |
| A | 7311251 | 5.2% |
| Other values (22) | 25924065 |
Common
| Value | Count | Frequency (%) |
| _ | 15993110 | |
| ; | 1760707 | 9.2% |
| 8 | 658738 | 3.5% |
| 4 | 658738 | 3.5% |
| 5 | 2 | < 0.1% |
| 3 | 2 | < 0.1% |
| 6 | 1 | < 0.1% |
| 1 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 159828710 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| _ | 15993110 | |
| E | 13659218 | 8.5% |
| R | 13467823 | 8.4% |
| N | 13134550 | 8.2% |
| I | 12670207 | 7.9% |
| C | 11725720 | 7.3% |
| U | 11078299 | 6.9% |
| T | 11054854 | 6.9% |
| D | 10756285 | 6.7% |
| O | 9975135 | 6.2% |
| Other values (34) | 36313509 |
mediaType
Text
Missing 
| Distinct | 59 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 863248 |
| Missing (%) | 36.6% |
| Memory size | 18.0 MiB |
Length
| Max length | 1011 |
|---|---|
| Median length | 10 |
| Mean length | 11.32266182 |
| Min length | 5 |
Unique
| Unique | 9 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | StillImage |
|---|---|
| 2nd row | StillImage |
| 3rd row | StillImage |
| 4th row | StillImage |
| 5th row | StillImage |
| Value | Count | Frequency (%) |
| stillimage | 1393845 | |
| stillimage;stillimage | 79719 | 5.3% |
| stillimage;stillimage;stillimage | 8722 | 0.6% |
| stillimage;stillimage;stillimage;stillimage | 7143 | 0.5% |
| stillimage;stillimage;stillimage;stillimage;stillimage | 2786 | 0.2% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 2282 | 0.2% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 958 | 0.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 570 | < 0.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 531 | < 0.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 332 | < 0.1% |
| Other values (49) | 1337 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 3356749 | |
| a | 1678375 | |
| e | 1678375 | |
| S | 1678374 | |
| t | 1678374 | |
| i | 1678374 | |
| I | 1678374 | |
| m | 1678374 | |
| g | 1678374 | |
| ; | 180150 | 1.1% |
| Other values (2) | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13426997 | |
| Uppercase Letter | 3356748 | 19.8% |
| Other Punctuation | 180150 | 1.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 3356749 | |
| a | 1678375 | |
| e | 1678375 | |
| t | 1678374 | |
| i | 1678374 | |
| m | 1678374 | |
| g | 1678374 | |
| f | 1 | < 0.1% |
| s | 1 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1678374 | |
| I | 1678374 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 180150 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16783745 | |
| Common | 180150 | 1.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 3356749 | |
| a | 1678375 | |
| e | 1678375 | |
| S | 1678374 | |
| t | 1678374 | |
| i | 1678374 | |
| I | 1678374 | |
| m | 1678374 | |
| g | 1678374 | |
| f | 1 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| ; | 180150 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16963895 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 3356749 | |
| a | 1678375 | |
| e | 1678375 | |
| S | 1678374 | |
| t | 1678374 | |
| i | 1678374 | |
| I | 1678374 | |
| m | 1678374 | |
| g | 1678374 | |
| ; | 180150 | 1.1% |
| Other values (2) | 2 | < 0.1% |
hasCoordinate
Text
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5 |
| Missing (%) | < 0.1% |
| Memory size | 18.0 MiB |
Length
| Max length | 48 |
|---|---|
| Median length | 5 |
| Mean length | 4.698685309 |
| Min length | 4 |
Unique
| Unique | 8 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | true |
|---|---|
| 2nd row | false |
| 3rd row | true |
| 4th row | false |
| 5th row | true |
| Value | Count | Frequency (%) |
| false | 1649753 | |
| true | 711707 | |
| 1914 | 1 | < 0.1% |
| mitchill | 1 | < 0.1% |
| bilinearis | 1 | < 0.1% |
| merluccius | 1 | < 0.1% |
| greene | 1 | < 0.1% |
| blumeri | 1 | < 0.1% |
| senecio | 1 | < 0.1% |
| 1900 | 1 | < 0.1% |
| Other values (17) | 17 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2361474 | |
| l | 1649761 | |
| s | 1649759 | |
| a | 1649759 | |
| f | 1649753 | |
| r | 711716 | 6.4% |
| u | 711716 | 6.4% |
| t | 711714 | 6.4% |
| 17 | < 0.1% | |
| i | 16 | < 0.1% |
| Other values (47) | 110 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11095715 | |
| Decimal Number | 27 | < 0.1% |
| Uppercase Letter | 25 | < 0.1% |
| Space Separator | 17 | < 0.1% |
| Other Punctuation | 6 | < 0.1% |
| Open Punctuation | 2 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
| Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2361474 | |
| l | 1649761 | |
| s | 1649759 | |
| a | 1649759 | |
| f | 1649753 | |
| r | 711716 | 6.4% |
| u | 711716 | 6.4% |
| t | 711714 | 6.4% |
| i | 16 | < 0.1% |
| o | 8 | < 0.1% |
| Other values (13) | 39 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 3 | 12.0% |
| M | 3 | 12.0% |
| A | 2 | 8.0% |
| R | 2 | 8.0% |
| L | 1 | 4.0% |
| D | 1 | 4.0% |
| S | 1 | 4.0% |
| Z | 1 | 4.0% |
| O | 1 | 4.0% |
| P | 1 | 4.0% |
| Other values (9) | 9 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 8 | |
| 9 | 4 | |
| 8 | 3 | 11.1% |
| 7 | 3 | 11.1% |
| 0 | 3 | 11.1% |
| 4 | 2 | 7.4% |
| 5 | 2 | 7.4% |
| 3 | 1 | 3.7% |
| 2 | 1 | 3.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 5 | |
| & | 1 | 16.7% |
Space Separator
| Value | Count | Frequency (%) |
| 17 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11095740 | |
| Common | 55 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2361474 | |
| l | 1649761 | |
| s | 1649759 | |
| a | 1649759 | |
| f | 1649753 | |
| r | 711716 | 6.4% |
| u | 711716 | 6.4% |
| t | 711714 | 6.4% |
| i | 16 | < 0.1% |
| o | 8 | < 0.1% |
| Other values (32) | 64 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 17 | ||
| 1 | 8 | |
| , | 5 | 9.1% |
| 9 | 4 | 7.3% |
| 8 | 3 | 5.5% |
| 7 | 3 | 5.5% |
| 0 | 3 | 5.5% |
| ( | 2 | 3.6% |
| 4 | 2 | 3.6% |
| ) | 2 | 3.6% |
| Other values (5) | 6 | 10.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11095794 | |
| None | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 2361474 | |
| l | 1649761 | |
| s | 1649759 | |
| a | 1649759 | |
| f | 1649753 | |
| r | 711716 | 6.4% |
| u | 711716 | 6.4% |
| t | 711714 | 6.4% |
| 17 | < 0.1% | |
| i | 16 | < 0.1% |
| Other values (46) | 109 | < 0.1% |
None
| Value | Count | Frequency (%) |
| ö | 1 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 10 |
| Missing (%) | < 0.1% |
| Memory size | 18.0 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 5 |
| Mean length | 4.993394349 |
| Min length | 4 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| false | 2345838 | |
| true | 15622 | 0.7% |
| north_america | 1 | < 0.1% |
| guatteria | 1 | < 0.1% |
| punctata | 1 | < 0.1% |
| species | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2361461 | |
| a | 2345842 | |
| f | 2345838 | |
| l | 2345838 | |
| s | 2345838 | |
| t | 15626 | 0.1% |
| u | 15624 | 0.1% |
| r | 15623 | 0.1% |
| E | 3 | < 0.1% |
| I | 2 | < 0.1% |
| Other values (17) | 21 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11791694 | |
| Uppercase Letter | 20 | < 0.1% |
| Space Separator | 1 | < 0.1% |
| Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 3 | |
| I | 2 | |
| A | 2 | |
| C | 2 | |
| R | 2 | |
| S | 2 | |
| G | 1 | 5.0% |
| M | 1 | 5.0% |
| H | 1 | 5.0% |
| T | 1 | 5.0% |
| Other values (3) | 3 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2361461 | |
| a | 2345842 | |
| f | 2345838 | |
| l | 2345838 | |
| s | 2345838 | |
| t | 15626 | 0.1% |
| u | 15624 | 0.1% |
| r | 15623 | 0.1% |
| p | 1 | < 0.1% |
| n | 1 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11791714 | |
| Common | 2 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2361461 | |
| a | 2345842 | |
| f | 2345838 | |
| l | 2345838 | |
| s | 2345838 | |
| t | 15626 | 0.1% |
| u | 15624 | 0.1% |
| r | 15623 | 0.1% |
| E | 3 | < 0.1% |
| I | 2 | < 0.1% |
| Other values (15) | 19 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 1 | ||
| _ | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11791716 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 2361461 | |
| a | 2345842 | |
| f | 2345838 | |
| l | 2345838 | |
| s | 2345838 | |
| t | 15626 | 0.1% |
| u | 15624 | 0.1% |
| r | 15623 | 0.1% |
| E | 3 | < 0.1% |
| I | 2 | < 0.1% |
| Other values (17) | 21 | < 0.1% |
taxonKey
Text
| Distinct | 362006 |
|---|---|
| Distinct (%) | 15.3% |
| Missing | 12 |
| Missing (%) | < 0.1% |
| Memory size | 18.0 MiB |
Length
| Max length | 37 |
|---|---|
| Median length | 7 |
| Mean length | 6.857343822 |
| Min length | 1 |
Unique
| Unique | 179314 ? |
|---|---|
| Unique (%) | 7.6% |
Sample
| 1st row | 3869 |
|---|---|
| 2nd row | 5374585 |
| 3rd row | 2431199 |
| 4th row | 714 |
| 5th row | 2322812 |
| Value | Count | Frequency (%) |
| 2431491 | 19390 | 0.8% |
| 225 | 6083 | 0.3% |
| 0 | 5762 | 0.2% |
| 8176985 | 4732 | 0.2% |
| 5967481 | 3865 | 0.2% |
| 2437967 | 3463 | 0.1% |
| 2431539 | 3260 | 0.1% |
| 2440447 | 2983 | 0.1% |
| 105 | 2810 | 0.1% |
| 1340278 | 2739 | 0.1% |
| Other values (361999) | 2306377 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2449798 | |
| 3 | 1788146 | |
| 4 | 1653847 | |
| 1 | 1611024 | |
| 5 | 1582086 | |
| 7 | 1533979 | |
| 6 | 1419285 | |
| 8 | 1406069 | |
| 9 | 1386672 | |
| 0 | 1362407 | |
| Other values (22) | 37 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 16193313 | |
| Lowercase Letter | 24 | < 0.1% |
| Uppercase Letter | 5 | < 0.1% |
| Other Punctuation | 3 | < 0.1% |
| Space Separator | 3 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 5 | |
| t | 4 | |
| u | 3 | |
| r | 2 | 8.3% |
| o | 1 | 4.2% |
| l | 1 | 4.2% |
| w | 1 | 4.2% |
| i | 1 | 4.2% |
| b | 1 | 4.2% |
| c | 1 | 4.2% |
| Other values (4) | 4 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2449798 | |
| 3 | 1788146 | |
| 4 | 1653847 | |
| 1 | 1611024 | |
| 5 | 1582086 | |
| 7 | 1533979 | |
| 6 | 1419285 | |
| 8 | 1406069 | |
| 9 | 1386672 | |
| 0 | 1362407 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2 | |
| H | 1 | |
| R | 1 | |
| G | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3 |
Space Separator
| Value | Count | Frequency (%) |
| 3 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 16193321 | |
| Latin | 29 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 5 | |
| t | 4 | |
| u | 3 | 10.3% |
| r | 2 | 6.9% |
| A | 2 | 6.9% |
| o | 1 | 3.4% |
| l | 1 | 3.4% |
| H | 1 | 3.4% |
| R | 1 | 3.4% |
| w | 1 | 3.4% |
| Other values (8) | 8 |
Common
| Value | Count | Frequency (%) |
| 2 | 2449798 | |
| 3 | 1788146 | |
| 4 | 1653847 | |
| 1 | 1611024 | |
| 5 | 1582086 | |
| 7 | 1533979 | |
| 6 | 1419285 | |
| 8 | 1406069 | |
| 9 | 1386672 | |
| 0 | 1362407 | |
| Other values (4) | 8 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16193350 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2449798 | |
| 3 | 1788146 | |
| 4 | 1653847 | |
| 1 | 1611024 | |
| 5 | 1582086 | |
| 7 | 1533979 | |
| 6 | 1419285 | |
| 8 | 1406069 | |
| 9 | 1386672 | |
| 0 | 1362407 | |
| Other values (22) | 37 | < 0.1% |
acceptedTaxonKey
Text
| Distinct | 315017 |
|---|---|
| Distinct (%) | 13.4% |
| Missing | 5774 |
| Missing (%) | 0.2% |
| Memory size | 18.0 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 7 |
| Mean length | 6.879705769 |
| Min length | 1 |
Unique
| Unique | 142213 ? |
|---|---|
| Unique (%) | 6.0% |
Sample
| 1st row | 3869 |
|---|---|
| 2nd row | 3044413 |
| 3rd row | 2431199 |
| 4th row | 714 |
| 5th row | 2322812 |
| Value | Count | Frequency (%) |
| 2431491 | 19390 | 0.8% |
| 225 | 6083 | 0.3% |
| 7947184 | 4743 | 0.2% |
| 5967481 | 3865 | 0.2% |
| 2437967 | 3815 | 0.2% |
| 2431539 | 3260 | 0.1% |
| 2440447 | 2987 | 0.1% |
| 105 | 2810 | 0.1% |
| 1340278 | 2739 | 0.1% |
| 2431224 | 2562 | 0.1% |
| Other values (315008) | 2303446 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2439480 | |
| 3 | 1778291 | |
| 4 | 1658961 | |
| 1 | 1632514 | |
| 5 | 1571567 | |
| 7 | 1538551 | |
| 8 | 1409644 | |
| 6 | 1402728 | |
| 9 | 1400354 | |
| 0 | 1374408 | |
| Other values (11) | 18 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 16206498 | |
| Lowercase Letter | 16 | < 0.1% |
| Space Separator | 1 | < 0.1% |
| Uppercase Letter | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2439480 | |
| 3 | 1778291 | |
| 4 | 1658961 | |
| 1 | 1632514 | |
| 5 | 1571567 | |
| 7 | 1538551 | |
| 8 | 1409644 | |
| 6 | 1402728 | |
| 9 | 1400354 | |
| 0 | 1374408 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4 | |
| t | 4 | |
| u | 2 | |
| n | 1 | 6.2% |
| p | 1 | 6.2% |
| i | 1 | 6.2% |
| r | 1 | 6.2% |
| e | 1 | 6.2% |
| c | 1 | 6.2% |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 16206499 | |
| Latin | 17 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2439480 | |
| 3 | 1778291 | |
| 4 | 1658961 | |
| 1 | 1632514 | |
| 5 | 1571567 | |
| 7 | 1538551 | |
| 8 | 1409644 | |
| 6 | 1402728 | |
| 9 | 1400354 | |
| 0 | 1374408 |
Latin
| Value | Count | Frequency (%) |
| a | 4 | |
| t | 4 | |
| u | 2 | |
| n | 1 | 5.9% |
| p | 1 | 5.9% |
| G | 1 | 5.9% |
| i | 1 | 5.9% |
| r | 1 | 5.9% |
| e | 1 | 5.9% |
| c | 1 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16206516 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2439480 | |
| 3 | 1778291 | |
| 4 | 1658961 | |
| 1 | 1632514 | |
| 5 | 1571567 | |
| 7 | 1538551 | |
| 8 | 1409644 | |
| 6 | 1402728 | |
| 9 | 1400354 | |
| 0 | 1374408 | |
| Other values (11) | 18 | < 0.1% |
kingdomKey
Text
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 12 |
| Missing (%) | < 0.1% |
| Memory size | 18.0 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 1 |
| Mean length | 1.00000974 |
| Min length | 1 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 6 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 1209386 | |
| 6 | 1054744 | |
| 5 | 56807 | 2.4% |
| 4 | 20874 | 0.9% |
| 3 | 13612 | 0.6% |
| 0 | 5762 | 0.2% |
| 7 | 275 | < 0.1% |
| sphaeralcea | 1 | < 0.1% |
| palmeri | 1 | < 0.1% |
| rose | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1209386 | |
| 6 | 1054744 | |
| 5 | 56807 | 2.4% |
| 4 | 20874 | 0.9% |
| 3 | 13612 | 0.6% |
| 0 | 5762 | 0.2% |
| 7 | 275 | < 0.1% |
| e | 4 | < 0.1% |
| a | 4 | < 0.1% |
| p | 2 | < 0.1% |
| Other values (11) | 14 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2361460 | |
| Lowercase Letter | 20 | < 0.1% |
| Space Separator | 2 | < 0.1% |
| Uppercase Letter | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 4 | |
| a | 4 | |
| p | 2 | |
| r | 2 | |
| l | 2 | |
| h | 1 | 5.0% |
| c | 1 | 5.0% |
| m | 1 | 5.0% |
| i | 1 | 5.0% |
| o | 1 | 5.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1209386 | |
| 6 | 1054744 | |
| 5 | 56807 | 2.4% |
| 4 | 20874 | 0.9% |
| 3 | 13612 | 0.6% |
| 0 | 5762 | 0.2% |
| 7 | 275 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1 | |
| R | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2361462 | |
| Latin | 22 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 4 | |
| a | 4 | |
| p | 2 | |
| r | 2 | |
| l | 2 | |
| h | 1 | 4.5% |
| S | 1 | 4.5% |
| c | 1 | 4.5% |
| m | 1 | 4.5% |
| i | 1 | 4.5% |
| Other values (3) | 3 |
Common
| Value | Count | Frequency (%) |
| 1 | 1209386 | |
| 6 | 1054744 | |
| 5 | 56807 | 2.4% |
| 4 | 20874 | 0.9% |
| 3 | 13612 | 0.6% |
| 0 | 5762 | 0.2% |
| 7 | 275 | < 0.1% |
| 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2361484 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1209386 | |
| 6 | 1054744 | |
| 5 | 56807 | 2.4% |
| 4 | 20874 | 0.9% |
| 3 | 13612 | 0.6% |
| 0 | 5762 | 0.2% |
| 7 | 275 | < 0.1% |
| e | 4 | < 0.1% |
| a | 4 | < 0.1% |
| p | 2 | < 0.1% |
| Other values (11) | 14 | < 0.1% |
phylumKey
Text
| Distinct | 63 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 7897 |
| Missing (%) | 0.3% |
| Memory size | 18.0 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 2 |
| Mean length | 4.119775185 |
| Min length | 1 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 54 |
|---|---|
| 2nd row | 7707728 |
| 3rd row | 44 |
| 4th row | 43 |
| 5th row | 42 |
| Value | Count | Frequency (%) |
| 7707728 | 965311 | |
| 44 | 572771 | |
| 54 | 252406 | 10.7% |
| 52 | 220179 | 9.4% |
| 42 | 61416 | 2.6% |
| 95 | 56083 | 2.4% |
| 35 | 37922 | 1.6% |
| 106 | 30954 | 1.3% |
| 43 | 29998 | 1.3% |
| 50 | 23220 | 1.0% |
| Other values (53) | 103316 | 4.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | 3890684 | |
| 4 | 1510363 | 15.6% |
| 2 | 1249598 | 12.9% |
| 0 | 1044134 | 10.8% |
| 8 | 1032331 | 10.6% |
| 5 | 623579 | 6.4% |
| 9 | 110301 | 1.1% |
| 3 | 83881 | 0.9% |
| 6 | 78759 | 0.8% |
| 1 | 72563 | 0.7% |
| Other values (8) | 11 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9696193 | |
| Uppercase Letter | 11 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 3890684 | |
| 4 | 1510363 | 15.6% |
| 2 | 1249598 | 12.9% |
| 0 | 1044134 | 10.8% |
| 8 | 1032331 | 10.6% |
| 5 | 623579 | 6.4% |
| 9 | 110301 | 1.1% |
| 3 | 83881 | 0.9% |
| 6 | 78759 | 0.8% |
| 1 | 72563 | 0.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 3 | |
| C | 2 | |
| M | 1 | 9.1% |
| L | 1 | 9.1% |
| A | 1 | 9.1% |
| P | 1 | 9.1% |
| T | 1 | 9.1% |
| D | 1 | 9.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 9696193 | |
| Latin | 11 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 7 | 3890684 | |
| 4 | 1510363 | 15.6% |
| 2 | 1249598 | 12.9% |
| 0 | 1044134 | 10.8% |
| 8 | 1032331 | 10.6% |
| 5 | 623579 | 6.4% |
| 9 | 110301 | 1.1% |
| 3 | 83881 | 0.9% |
| 6 | 78759 | 0.8% |
| 1 | 72563 | 0.7% |
Latin
| Value | Count | Frequency (%) |
| E | 3 | |
| C | 2 | |
| M | 1 | 9.1% |
| L | 1 | 9.1% |
| A | 1 | 9.1% |
| P | 1 | 9.1% |
| T | 1 | 9.1% |
| D | 1 | 9.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9696204 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7 | 3890684 | |
| 4 | 1510363 | 15.6% |
| 2 | 1249598 | 12.9% |
| 0 | 1044134 | 10.8% |
| 8 | 1032331 | 10.6% |
| 5 | 623579 | 6.4% |
| 9 | 110301 | 1.1% |
| 3 | 83881 | 0.9% |
| 6 | 78759 | 0.8% |
| 1 | 72563 | 0.7% |
| Other values (8) | 11 | < 0.1% |
classKey
Text
Missing 
| Distinct | 185 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 138564 |
| Missing (%) | 5.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 3 |
| Mean length | 3.356916995 |
| Min length | 3 |
Unique
| Unique | 19 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 229 |
|---|---|
| 2nd row | 220 |
| 3rd row | 131 |
| 4th row | 206 |
| 5th row | 256 |
| Value | Count | Frequency (%) |
| 220 | 657370 | |
| 196 | 231154 | 10.4% |
| 225 | 155259 | 7.0% |
| 359 | 152953 | 6.9% |
| 216 | 149742 | 6.7% |
| 212 | 149231 | 6.7% |
| 131 | 100689 | 4.5% |
| 229 | 76525 | 3.4% |
| 7228684 | 63916 | 2.9% |
| 256 | 53619 | 2.4% |
| Other values (175) | 432451 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2662639 | |
| 1 | 1116752 | |
| 0 | 779965 | 10.5% |
| 5 | 577305 | 7.7% |
| 6 | 574737 | 7.7% |
| 9 | 561562 | 7.5% |
| 3 | 542610 | 7.3% |
| 7 | 243304 | 3.3% |
| 8 | 204616 | 2.7% |
| 4 | 198624 | 2.7% |
| Other values (5) | 7 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7462114 | |
| Other Punctuation | 3 | < 0.1% |
| Dash Punctuation | 2 | < 0.1% |
| Uppercase Letter | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2662639 | |
| 1 | 1116752 | |
| 0 | 779965 | 10.5% |
| 5 | 577305 | 7.7% |
| 6 | 574737 | 7.7% |
| 9 | 561562 | 7.5% |
| 3 | 542610 | 7.3% |
| 7 | 243304 | 3.3% |
| 8 | 204616 | 2.7% |
| 4 | 198624 | 2.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 2 | |
| . | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1 | |
| Z | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7462119 | |
| Latin | 2 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2662639 | |
| 1 | 1116752 | |
| 0 | 779965 | 10.5% |
| 5 | 577305 | 7.7% |
| 6 | 574737 | 7.7% |
| 9 | 561562 | 7.5% |
| 3 | 542610 | 7.3% |
| 7 | 243304 | 3.3% |
| 8 | 204616 | 2.7% |
| 4 | 198624 | 2.7% |
| Other values (3) | 5 | < 0.1% |
Latin
| Value | Count | Frequency (%) |
| T | 1 | |
| Z | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7462121 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2662639 | |
| 1 | 1116752 | |
| 0 | 779965 | 10.5% |
| 5 | 577305 | 7.7% |
| 6 | 574737 | 7.7% |
| 9 | 561562 | 7.5% |
| 3 | 542610 | 7.3% |
| 7 | 243304 | 3.3% |
| 8 | 204616 | 2.7% |
| 4 | 198624 | 2.7% |
| Other values (5) | 7 | < 0.1% |
orderKey
Text
Missing 
| Distinct | 932 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 145723 |
| Missing (%) | 6.2% |
| Memory size | 18.0 MiB |
Length
| Max length | 133 |
|---|---|
| Median length | 3 |
| Mean length | 3.806610403 |
| Min length | 3 |
Unique
| Unique | 81 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 637 |
|---|---|
| 2nd row | 7225535 |
| 3rd row | 953 |
| 4th row | 714 |
| 5th row | 865 |
| Value | Count | Frequency (%) |
| 1369 | 178531 | 8.1% |
| 414 | 96944 | 4.4% |
| 729 | 94751 | 4.3% |
| 1459 | 75757 | 3.4% |
| 408 | 67866 | 3.1% |
| 1370 | 64632 | 2.9% |
| 953 | 60565 | 2.7% |
| 587 | 54527 | 2.5% |
| 1414 | 53482 | 2.4% |
| 637 | 49962 | 2.3% |
| Other values (947) | 1418762 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1389199 | |
| 9 | 1119561 | |
| 4 | 1065109 | |
| 3 | 921277 | |
| 7 | 869118 | |
| 2 | 694377 | |
| 5 | 649591 | |
| 6 | 629528 | |
| 0 | 603865 | |
| 8 | 492425 | 5.8% |
| Other values (40) | 447 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8434050 | |
| Lowercase Letter | 347 | < 0.1% |
| Uppercase Letter | 37 | < 0.1% |
| Other Punctuation | 32 | < 0.1% |
| Space Separator | 29 | < 0.1% |
| Dash Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 50 | |
| e | 42 | |
| i | 37 | |
| t | 29 | |
| o | 28 | 8.1% |
| l | 21 | 6.1% |
| r | 21 | 6.1% |
| d | 19 | 5.5% |
| s | 15 | 4.3% |
| n | 15 | 4.3% |
| Other values (11) | 70 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 11 | |
| P | 7 | |
| C | 3 | 8.1% |
| T | 3 | 8.1% |
| M | 2 | 5.4% |
| N | 2 | 5.4% |
| D | 2 | 5.4% |
| G | 1 | 2.7% |
| O | 1 | 2.7% |
| H | 1 | 2.7% |
| Other values (4) | 4 | 10.8% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1389199 | |
| 9 | 1119561 | |
| 4 | 1065109 | |
| 3 | 921277 | |
| 7 | 869118 | |
| 2 | 694377 | |
| 5 | 649591 | |
| 6 | 629528 | |
| 0 | 603865 | |
| 8 | 492425 | 5.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 29 | |
| : | 2 | 6.2% |
| . | 1 | 3.1% |
Space Separator
| Value | Count | Frequency (%) |
| 29 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8434113 | |
| Latin | 384 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 50 | |
| e | 42 | 10.9% |
| i | 37 | 9.6% |
| t | 29 | 7.6% |
| o | 28 | 7.3% |
| l | 21 | 5.5% |
| r | 21 | 5.5% |
| d | 19 | 4.9% |
| s | 15 | 3.9% |
| n | 15 | 3.9% |
| Other values (25) | 107 |
Common
| Value | Count | Frequency (%) |
| 1 | 1389199 | |
| 9 | 1119561 | |
| 4 | 1065109 | |
| 3 | 921277 | |
| 7 | 869118 | |
| 2 | 694377 | |
| 5 | 649591 | |
| 6 | 629528 | |
| 0 | 603865 | |
| 8 | 492425 | 5.8% |
| Other values (5) | 63 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8434497 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1389199 | |
| 9 | 1119561 | |
| 4 | 1065109 | |
| 3 | 921277 | |
| 7 | 869118 | |
| 2 | 694377 | |
| 5 | 649591 | |
| 6 | 629528 | |
| 0 | 603865 | |
| 8 | 492425 | 5.8% |
| Other values (40) | 447 | < 0.1% |
familyKey
Text
Missing 
| Distinct | 6628 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 52492 |
| Missing (%) | 2.2% |
| Memory size | 18.0 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 4 |
| Mean length | 4.265161125 |
| Min length | 4 |
Unique
| Unique | 723 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 3869 |
|---|---|
| 2nd row | 3112 |
| 3rd row | 6748 |
| 4th row | 2051 |
| 5th row | 4486 |
| Value | Count | Frequency (%) |
| 3073 | 128004 | 5.5% |
| 3065 | 91253 | 4.0% |
| 5386 | 60425 | 2.6% |
| 6748 | 56509 | 2.4% |
| 7708 | 35190 | 1.5% |
| 8798 | 30478 | 1.3% |
| 3240723 | 27411 | 1.2% |
| 5510 | 23714 | 1.0% |
| 4334 | 20894 | 0.9% |
| 6683 | 18664 | 0.8% |
| Other values (6618) | 1816439 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 1376781 | |
| 3 | 1350810 | |
| 7 | 1111863 | |
| 5 | 1030804 | |
| 4 | 948921 | |
| 2 | 926218 | |
| 8 | 924745 | |
| 0 | 815403 | |
| 9 | 744345 | |
| 1 | 618216 | |
| Other values (19) | 70 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9848106 | |
| Lowercase Letter | 60 | < 0.1% |
| Uppercase Letter | 6 | < 0.1% |
| Dash Punctuation | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 15 | |
| i | 9 | |
| n | 5 | 8.3% |
| m | 5 | 8.3% |
| l | 5 | 8.3% |
| c | 4 | 6.7% |
| t | 3 | 5.0% |
| e | 3 | 5.0% |
| b | 3 | 5.0% |
| d | 2 | 3.3% |
| Other values (5) | 6 | 10.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 1376781 | |
| 3 | 1350810 | |
| 7 | 1111863 | |
| 5 | 1030804 | |
| 4 | 948921 | |
| 2 | 926218 | |
| 8 | 924745 | |
| 0 | 815403 | |
| 9 | 744345 | |
| 1 | 618216 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 4 | |
| P | 1 | 16.7% |
| C | 1 | 16.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 9848110 | |
| Latin | 66 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 15 | |
| i | 9 | |
| n | 5 | 7.6% |
| m | 5 | 7.6% |
| l | 5 | 7.6% |
| c | 4 | 6.1% |
| A | 4 | 6.1% |
| t | 3 | 4.5% |
| e | 3 | 4.5% |
| b | 3 | 4.5% |
| Other values (8) | 10 |
Common
| Value | Count | Frequency (%) |
| 6 | 1376781 | |
| 3 | 1350810 | |
| 7 | 1111863 | |
| 5 | 1030804 | |
| 4 | 948921 | |
| 2 | 926218 | |
| 8 | 924745 | |
| 0 | 815403 | |
| 9 | 744345 | |
| 1 | 618216 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9848176 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6 | 1376781 | |
| 3 | 1350810 | |
| 7 | 1111863 | |
| 5 | 1030804 | |
| 4 | 948921 | |
| 2 | 926218 | |
| 8 | 924745 | |
| 0 | 815403 | |
| 9 | 744345 | |
| 1 | 618216 | |
| Other values (19) | 70 | < 0.1% |
genusKey
Text
Missing 
| Distinct | 59199 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 120649 |
| Missing (%) | 5.1% |
| Memory size | 18.0 MiB |
Length
| Max length | 15 |
|---|---|
| Median length | 7 |
| Mean length | 7.014589276 |
| Min length | 2 |
Unique
| Unique | 16745 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | 3044392 |
|---|---|
| 2nd row | 2431198 |
| 3rd row | 2322781 |
| 4th row | 4798968 |
| 5th row | 4557352 |
| Value | Count | Frequency (%) |
| 2431477 | 42953 | 1.9% |
| 1340278 | 15824 | 0.7% |
| 2721893 | 14686 | 0.7% |
| 3188558 | 10093 | 0.5% |
| 2437961 | 10025 | 0.4% |
| 2431198 | 9258 | 0.4% |
| 2607519 | 7917 | 0.4% |
| 2704173 | 7658 | 0.3% |
| 2713455 | 7007 | 0.3% |
| 2705540 | 6575 | 0.3% |
| Other values (59189) | 2108828 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2750236 | |
| 3 | 1857335 | |
| 4 | 1678697 | |
| 7 | 1472783 | |
| 1 | 1465143 | |
| 8 | 1402390 | |
| 9 | 1391416 | |
| 0 | 1293394 | |
| 6 | 1218552 | |
| 5 | 1188447 | |
| Other values (23) | 67 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 15718393 | |
| Lowercase Letter | 59 | < 0.1% |
| Uppercase Letter | 8 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 9 | |
| t | 7 | |
| h | 7 | |
| e | 6 | |
| o | 5 | |
| y | 4 | 6.8% |
| l | 4 | 6.8% |
| m | 3 | 5.1% |
| i | 2 | 3.4% |
| n | 2 | 3.4% |
| Other values (6) | 10 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2750236 | |
| 3 | 1857335 | |
| 4 | 1678697 | |
| 7 | 1472783 | |
| 1 | 1465143 | |
| 8 | 1402390 | |
| 9 | 1391416 | |
| 0 | 1293394 | |
| 6 | 1218552 | |
| 5 | 1188447 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 2 | |
| U | 1 | |
| M | 1 | |
| S | 1 | |
| C | 1 | |
| T | 1 | |
| N | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 15718393 | |
| Latin | 67 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 9 | |
| t | 7 | 10.4% |
| h | 7 | 10.4% |
| e | 6 | 9.0% |
| o | 5 | 7.5% |
| y | 4 | 6.0% |
| l | 4 | 6.0% |
| m | 3 | 4.5% |
| P | 2 | 3.0% |
| i | 2 | 3.0% |
| Other values (13) | 18 |
Common
| Value | Count | Frequency (%) |
| 2 | 2750236 | |
| 3 | 1857335 | |
| 4 | 1678697 | |
| 7 | 1472783 | |
| 1 | 1465143 | |
| 8 | 1402390 | |
| 9 | 1391416 | |
| 0 | 1293394 | |
| 6 | 1218552 | |
| 5 | 1188447 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15718460 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2750236 | |
| 3 | 1857335 | |
| 4 | 1678697 | |
| 7 | 1472783 | |
| 1 | 1465143 | |
| 8 | 1402390 | |
| 9 | 1391416 | |
| 0 | 1293394 | |
| 6 | 1218552 | |
| 5 | 1188447 | |
| Other values (23) | 67 | < 0.1% |
subgenusKey
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361466 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 11 |
| Mean length | 11.14285714 |
| Min length | 2 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Chromadorea |
|---|---|
| 2nd row | Aconoidasida |
| 3rd row | NE |
| 4th row | Cestoda |
| 5th row | Trematoda |
| Value | Count | Frequency (%) |
| chromadorea | 1 | |
| aconoidasida | 1 | |
| ne | 1 | |
| cestoda | 1 | |
| trematoda | 1 | |
| magnoliopsida | 1 | |
| 2024-12-02t13:59:17.155z | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 9 | 11.5% |
| o | 8 | 10.3% |
| d | 6 | 7.7% |
| 1 | 4 | 5.1% |
| i | 4 | 5.1% |
| 2 | 4 | 5.1% |
| r | 3 | 3.8% |
| 5 | 3 | 3.8% |
| e | 3 | 3.8% |
| s | 3 | 3.8% |
| Other values (23) | 31 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 47 | |
| Decimal Number | 17 | 21.8% |
| Uppercase Letter | 9 | 11.5% |
| Other Punctuation | 3 | 3.8% |
| Dash Punctuation | 2 | 2.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 9 | |
| o | 8 | |
| d | 6 | |
| i | 4 | |
| r | 3 | 6.4% |
| e | 3 | 6.4% |
| s | 3 | 6.4% |
| t | 2 | 4.3% |
| n | 2 | 4.3% |
| m | 2 | 4.3% |
| Other values (5) | 5 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 4 | |
| 2 | 4 | |
| 5 | 3 | |
| 0 | 2 | |
| 7 | 1 | 5.9% |
| 9 | 1 | 5.9% |
| 3 | 1 | 5.9% |
| 4 | 1 | 5.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 2 | |
| T | 2 | |
| A | 1 | |
| M | 1 | |
| N | 1 | |
| E | 1 | |
| Z | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 2 | |
| . | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 56 | |
| Common | 22 | 28.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 9 | |
| o | 8 | |
| d | 6 | |
| i | 4 | 7.1% |
| r | 3 | 5.4% |
| e | 3 | 5.4% |
| s | 3 | 5.4% |
| C | 2 | 3.6% |
| t | 2 | 3.6% |
| T | 2 | 3.6% |
| Other values (12) | 14 |
Common
| Value | Count | Frequency (%) |
| 1 | 4 | |
| 2 | 4 | |
| 5 | 3 | |
| : | 2 | |
| - | 2 | |
| 0 | 2 | |
| . | 1 | 4.5% |
| 7 | 1 | 4.5% |
| 9 | 1 | 4.5% |
| 3 | 1 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 78 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 9 | 11.5% |
| o | 8 | 10.3% |
| d | 6 | 7.7% |
| 1 | 4 | 5.1% |
| i | 4 | 5.1% |
| 2 | 4 | 5.1% |
| r | 3 | 3.8% |
| 5 | 3 | 3.8% |
| e | 3 | 3.8% |
| s | 3 | 3.8% |
| Other values (23) | 31 |
speciesKey
Text
Missing 
| Distinct | 271285 |
|---|---|
| Distinct (%) | 13.2% |
| Missing | 306496 |
| Missing (%) | 13.0% |
| Memory size | 18.0 MiB |
Length
| Max length | 43 |
|---|---|
| Median length | 7 |
| Mean length | 7.026590079 |
| Min length | 5 |
Unique
| Unique | 123606 ? |
|---|---|
| Unique (%) | 6.0% |
Sample
| 1st row | 3044413 |
|---|---|
| 2nd row | 2431199 |
| 3rd row | 2322812 |
| 4th row | 10722387 |
| 5th row | 2429795 |
| Value | Count | Frequency (%) |
| 2431491 | 19390 | 0.9% |
| 2437967 | 4075 | 0.2% |
| 2431539 | 3260 | 0.2% |
| 2440447 | 2987 | 0.1% |
| 2431224 | 2562 | 0.1% |
| 2431506 | 2541 | 0.1% |
| 2433176 | 2143 | 0.1% |
| 2431516 | 2047 | 0.1% |
| 2438019 | 1908 | 0.1% |
| 2438655 | 1829 | 0.1% |
| Other values (271278) | 2012238 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2212881 | |
| 3 | 1628160 | |
| 4 | 1530596 | |
| 1 | 1427667 | |
| 5 | 1423128 | |
| 7 | 1309681 | |
| 8 | 1277861 | |
| 9 | 1262298 | |
| 0 | 1219617 | |
| 6 | 1147471 | |
| Other values (29) | 121 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 14439360 | |
| Lowercase Letter | 104 | < 0.1% |
| Uppercase Letter | 10 | < 0.1% |
| Other Punctuation | 4 | < 0.1% |
| Space Separator | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 18 | |
| e | 12 | |
| i | 10 | |
| l | 10 | |
| o | 9 | |
| d | 7 | 6.7% |
| s | 6 | 5.8% |
| r | 6 | 5.8% |
| t | 5 | 4.8% |
| h | 4 | 3.8% |
| Other values (9) | 17 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2212881 | |
| 3 | 1628160 | |
| 4 | 1530596 | |
| 1 | 1427667 | |
| 5 | 1423128 | |
| 7 | 1309681 | |
| 8 | 1277861 | |
| 9 | 1262298 | |
| 0 | 1219617 | |
| 6 | 1147471 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 3 | |
| M | 2 | |
| A | 1 | 10.0% |
| G | 1 | 10.0% |
| H | 1 | 10.0% |
| R | 1 | 10.0% |
| D | 1 | 10.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 3 | |
| . | 1 | 25.0% |
Space Separator
| Value | Count | Frequency (%) |
| 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 14439367 | |
| Latin | 114 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 18 | |
| e | 12 | |
| i | 10 | 8.8% |
| l | 10 | 8.8% |
| o | 9 | 7.9% |
| d | 7 | 6.1% |
| s | 6 | 5.3% |
| r | 6 | 5.3% |
| t | 5 | 4.4% |
| h | 4 | 3.5% |
| Other values (16) | 27 |
Common
| Value | Count | Frequency (%) |
| 2 | 2212881 | |
| 3 | 1628160 | |
| 4 | 1530596 | |
| 1 | 1427667 | |
| 5 | 1423128 | |
| 7 | 1309681 | |
| 8 | 1277861 | |
| 9 | 1262298 | |
| 0 | 1219617 | |
| 6 | 1147471 | |
| Other values (3) | 7 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14439481 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2212881 | |
| 3 | 1628160 | |
| 4 | 1530596 | |
| 1 | 1427667 | |
| 5 | 1423128 | |
| 7 | 1309681 | |
| 8 | 1277861 | |
| 9 | 1262298 | |
| 0 | 1219617 | |
| 6 | 1147471 | |
| Other values (29) | 121 | < 0.1% |
species
Text
Missing 
| Distinct | 270918 |
|---|---|
| Distinct (%) | 13.2% |
| Missing | 306502 |
| Missing (%) | 13.0% |
| Memory size | 18.0 MiB |
Length
| Max length | 41 |
|---|---|
| Median length | 35 |
| Mean length | 18.94208239 |
| Min length | 4 |
Unique
| Unique | 123378 ? |
|---|---|
| Unique (%) | 6.0% |
Sample
| 1st row | Paysonia lescurii |
|---|---|
| 2nd row | Desmognathus ochrophaeus |
| 3rd row | Ninoe kinbergi |
| 4th row | Hylogomphus adelphus |
| 5th row | Scaphiopus couchii |
| Value | Count | Frequency (%) |
| plethodon | 42272 | 1.0% |
| cinereus | 21325 | 0.5% |
| carex | 14397 | 0.4% |
| bombus | 13087 | 0.3% |
| peromyscus | 10009 | 0.2% |
| miconia | 9511 | 0.2% |
| desmognathus | 9016 | 0.2% |
| cladonia | 7557 | 0.2% |
| poa | 7484 | 0.2% |
| cyperus | 6932 | 0.2% |
| Other values (144983) | 3968538 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4398026 | 11.3% |
| i | 3607850 | 9.3% |
| s | 2760190 | 7.1% |
| e | 2620738 | 6.7% |
| o | 2514893 | 6.5% |
| r | 2406574 | 6.2% |
| l | 2172600 | 5.6% |
| u | 2139863 | 5.5% |
| n | 2075734 | 5.3% |
| 2055157 | 5.3% | |
| Other values (49) | 12173805 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 34810486 | |
| Space Separator | 2055157 | 5.3% |
| Uppercase Letter | 2055001 | 5.3% |
| Dash Punctuation | 4780 | < 0.1% |
| Decimal Number | 3 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
| Other Punctuation | 1 | < 0.1% |
| Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4398026 | |
| i | 3607850 | |
| s | 2760190 | 7.9% |
| e | 2620738 | 7.5% |
| o | 2514893 | 7.2% |
| r | 2406574 | 6.9% |
| l | 2172600 | 6.2% |
| u | 2139863 | 6.1% |
| n | 2075734 | 6.0% |
| t | 1885466 | 5.4% |
| Other values (16) | 8228552 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 312538 | |
| C | 268039 | |
| S | 190370 | |
| A | 188463 | |
| M | 140447 | 6.8% |
| E | 112512 | 5.5% |
| L | 109813 | 5.3% |
| D | 94012 | 4.6% |
| T | 93925 | 4.6% |
| B | 83855 | 4.1% |
| Other values (16) | 461027 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 5 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2055157 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4780 |
Math Symbol
| Value | Count | Frequency (%) |
| × | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 36865487 | |
| Common | 2059943 | 5.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4398026 | |
| i | 3607850 | 9.8% |
| s | 2760190 | 7.5% |
| e | 2620738 | 7.1% |
| o | 2514893 | 6.8% |
| r | 2406574 | 6.5% |
| l | 2172600 | 5.9% |
| u | 2139863 | 5.8% |
| n | 2075734 | 5.6% |
| t | 1885466 | 5.1% |
| Other values (42) | 10283553 |
Common
| Value | Count | Frequency (%) |
| 2055157 | ||
| - | 4780 | 0.2% |
| 0 | 2 | < 0.1% |
| × | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| . | 1 | < 0.1% |
| _ | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38925429 | |
| None | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4398026 | 11.3% |
| i | 3607850 | 9.3% |
| s | 2760190 | 7.1% |
| e | 2620738 | 6.7% |
| o | 2514893 | 6.5% |
| r | 2406574 | 6.2% |
| l | 2172600 | 5.6% |
| u | 2139863 | 5.5% |
| n | 2075734 | 5.3% |
| 2055157 | 5.3% | |
| Other values (48) | 12173804 |
None
| Value | Count | Frequency (%) |
| × | 1 |
| Distinct | 315019 |
|---|---|
| Distinct (%) | 13.4% |
| Missing | 5767 |
| Missing (%) | 0.2% |
| Memory size | 18.0 MiB |
Length
| Max length | 234 |
|---|---|
| Median length | 129 |
| Mean length | 32.19431075 |
| Min length | 4 |
Unique
| Unique | 142214 ? |
|---|---|
| Unique (%) | 6.0% |
Sample
| 1st row | Hippolytidae |
|---|---|
| 2nd row | Paysonia lescurii (A.Gray) O'Kane & Al-Shehbaz |
| 3rd row | Desmognathus ochrophaeus Cope, 1859 |
| 4th row | Scleractinia |
| 5th row | Ninoe kinbergi Ehlers, 1887 |
| Value | Count | Frequency (%) |
| 263619 | 2.9% | |
| l | 187761 | 2.0% |
| ex | 84339 | 0.9% |
| linnaeus | 82433 | 0.9% |
| 1758 | 64115 | 0.7% |
| plethodon | 42963 | 0.5% |
| var | 34730 | 0.4% |
| 1818 | 33708 | 0.4% |
| subsp | 33211 | 0.4% |
| kunth | 31136 | 0.3% |
| Other values (179005) | 8332858 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6835167 | 9.0% | |
| a | 6257797 | 8.3% |
| i | 5019804 | 6.6% |
| e | 4734764 | 6.2% |
| r | 3964580 | 5.2% |
| s | 3861386 | 5.1% |
| o | 3664229 | 4.8% |
| n | 3503274 | 4.6% |
| l | 3422094 | 4.5% |
| u | 2966100 | 3.9% |
| Other values (124) | 31611136 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 53031876 | |
| Space Separator | 6835167 | 9.0% |
| Uppercase Letter | 6180529 | 8.1% |
| Decimal Number | 4384846 | 5.8% |
| Other Punctuation | 3234472 | 4.3% |
| Open Punctuation | 1070948 | 1.4% |
| Close Punctuation | 1070948 | 1.4% |
| Dash Punctuation | 28359 | < 0.1% |
| Math Symbol | 3161 | < 0.1% |
| Connector Punctuation | 25 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 6257797 | |
| i | 5019804 | 9.5% |
| e | 4734764 | 8.9% |
| r | 3964580 | 7.5% |
| s | 3861386 | 7.3% |
| o | 3664229 | 6.9% |
| n | 3503274 | 6.6% |
| l | 3422094 | 6.5% |
| u | 2966100 | 5.6% |
| t | 2761470 | 5.2% |
| Other values (61) | 12876378 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 588592 | 9.5% |
| S | 572550 | 9.3% |
| C | 544180 | 8.8% |
| P | 518690 | 8.4% |
| A | 420984 | 6.8% |
| M | 419308 | 6.8% |
| B | 409951 | 6.6% |
| H | 341217 | 5.5% |
| G | 326803 | 5.3% |
| D | 278325 | 4.5% |
| Other values (33) | 1759929 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1313369 | |
| 8 | 927854 | |
| 9 | 466939 | 10.6% |
| 7 | 355077 | 8.1% |
| 5 | 255118 | 5.8% |
| 0 | 230866 | 5.3% |
| 2 | 225110 | 5.1% |
| 6 | 220162 | 5.0% |
| 3 | 203545 | 4.6% |
| 4 | 186806 | 4.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1840636 | |
| , | 1124774 | |
| & | 263619 | 8.2% |
| ' | 5443 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 6835167 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1070948 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1070948 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 28359 |
Math Symbol
| Value | Count | Frequency (%) |
| × | 3161 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 25 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 59212405 | |
| Common | 16627926 | 21.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 6257797 | 10.6% |
| i | 5019804 | 8.5% |
| e | 4734764 | 8.0% |
| r | 3964580 | 6.7% |
| s | 3861386 | 6.5% |
| o | 3664229 | 6.2% |
| n | 3503274 | 5.9% |
| l | 3422094 | 5.8% |
| u | 2966100 | 5.0% |
| t | 2761470 | 4.7% |
| Other values (104) | 19056907 |
Common
| Value | Count | Frequency (%) |
| 6835167 | ||
| . | 1840636 | 11.1% |
| 1 | 1313369 | 7.9% |
| , | 1124774 | 6.8% |
| ( | 1070948 | 6.4% |
| ) | 1070948 | 6.4% |
| 8 | 927854 | 5.6% |
| 9 | 466939 | 2.8% |
| 7 | 355077 | 2.1% |
| & | 263619 | 1.6% |
| Other values (10) | 1358595 | 8.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 75704690 | |
| None | 135641 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6835167 | 9.0% | |
| a | 6257797 | 8.3% |
| i | 5019804 | 6.6% |
| e | 4734764 | 6.3% |
| r | 3964580 | 5.2% |
| s | 3861386 | 5.1% |
| o | 3664229 | 4.8% |
| n | 3503274 | 4.6% |
| l | 3422094 | 4.5% |
| u | 2966100 | 3.9% |
| Other values (61) | 31475495 |
None
| Value | Count | Frequency (%) |
| ü | 40827 | |
| é | 28282 | |
| ö | 18079 | |
| è | 11323 | 8.3% |
| á | 5102 | 3.8% |
| ä | 4987 | 3.7% |
| å | 4932 | 3.6% |
| ø | 4642 | 3.4% |
| × | 3161 | 2.3% |
| Á | 2128 | 1.6% |
| Other values (53) | 12178 | 9.0% |
Missing 
| Distinct | 389005 |
|---|---|
| Distinct (%) | 17.2% |
| Missing | 94306 |
| Missing (%) | 4.0% |
| Memory size | 18.0 MiB |
Length
| Max length | 125 |
|---|---|
| Median length | 97 |
| Mean length | 20.1873435 |
| Min length | 3 |
Unique
| Unique | 203004 ? |
|---|---|
| Unique (%) | 9.0% |
Sample
| 1st row | Lesquerella lescurii |
|---|---|
| 2nd row | Desmognathus ochrophaeus |
| 3rd row | Ninoe kinbergi |
| 4th row | Gomphus adelphus |
| 5th row | Skrjabinoclava catoptrophori |
| Value | Count | Frequency (%) |
| sp | 138550 | 2.8% |
| var | 54090 | 1.1% |
| plethodon | 42963 | 0.9% |
| subsp | 26921 | 0.5% |
| cinereus | 21966 | 0.4% |
| bombus | 17610 | 0.4% |
| carex | 14678 | 0.3% |
| indet | 10551 | 0.2% |
| peromyscus | 10026 | 0.2% |
| desmognathus | 9258 | 0.2% |
| Other values (177028) | 4602653 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 5045324 | 11.0% |
| i | 4145959 | 9.1% |
| s | 3377285 | 7.4% |
| e | 3010560 | 6.6% |
| o | 2851325 | 6.2% |
| r | 2816880 | 6.2% |
| 2682099 | 5.9% | |
| u | 2473140 | 5.4% |
| l | 2459207 | 5.4% |
| n | 2395933 | 5.2% |
| Other values (84) | 14510367 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 40393006 | |
| Space Separator | 2682099 | 5.9% |
| Uppercase Letter | 2329247 | 5.1% |
| Other Punctuation | 248057 | 0.5% |
| Open Punctuation | 53941 | 0.1% |
| Close Punctuation | 53940 | 0.1% |
| Dash Punctuation | 5649 | < 0.1% |
| Decimal Number | 2054 | < 0.1% |
| Connector Punctuation | 78 | < 0.1% |
| Math Symbol | 8 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 5045324 | |
| i | 4145959 | |
| s | 3377285 | 8.4% |
| e | 3010560 | 7.5% |
| o | 2851325 | 7.1% |
| r | 2816880 | 7.0% |
| u | 2473140 | 6.1% |
| l | 2459207 | 6.1% |
| n | 2395933 | 5.9% |
| t | 2138803 | 5.3% |
| Other values (27) | 9678590 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 347226 | |
| C | 308396 | |
| A | 214038 | 9.2% |
| S | 207550 | 8.9% |
| M | 156578 | 6.7% |
| L | 125199 | 5.4% |
| E | 119161 | 5.1% |
| T | 112361 | 4.8% |
| D | 105101 | 4.5% |
| B | 97827 | 4.2% |
| Other values (18) | 535810 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 597 | |
| 1 | 490 | |
| 0 | 442 | |
| 5 | 259 | |
| 9 | 66 | 3.2% |
| 8 | 53 | 2.6% |
| 3 | 48 | 2.3% |
| 7 | 42 | 2.0% |
| 4 | 33 | 1.6% |
| 6 | 24 | 1.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 242373 | |
| " | 2356 | 0.9% |
| , | 1333 | 0.5% |
| ' | 1120 | 0.5% |
| & | 607 | 0.2% |
| ? | 178 | 0.1% |
| / | 75 | < 0.1% |
| # | 14 | < 0.1% |
| ; | 1 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 3 | |
| × | 3 | |
| ~ | 2 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 53920 | |
| [ | 21 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 53919 | |
| ] | 21 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2682099 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5649 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 78 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 42722253 | |
| Common | 3045826 | 6.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 5045324 | |
| i | 4145959 | 9.7% |
| s | 3377285 | 7.9% |
| e | 3010560 | 7.0% |
| o | 2851325 | 6.7% |
| r | 2816880 | 6.6% |
| u | 2473140 | 5.8% |
| l | 2459207 | 5.8% |
| n | 2395933 | 5.6% |
| t | 2138803 | 5.0% |
| Other values (55) | 12007837 |
Common
| Value | Count | Frequency (%) |
| 2682099 | ||
| . | 242373 | 8.0% |
| ( | 53920 | 1.8% |
| ) | 53919 | 1.8% |
| - | 5649 | 0.2% |
| " | 2356 | 0.1% |
| , | 1333 | < 0.1% |
| ' | 1120 | < 0.1% |
| & | 607 | < 0.1% |
| 2 | 597 | < 0.1% |
| Other values (19) | 1853 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 45767767 | |
| None | 312 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 5045324 | 11.0% |
| i | 4145959 | 9.1% |
| s | 3377285 | 7.4% |
| e | 3010560 | 6.6% |
| o | 2851325 | 6.2% |
| r | 2816880 | 6.2% |
| 2682099 | 5.9% | |
| u | 2473140 | 5.4% |
| l | 2459207 | 5.4% |
| n | 2395933 | 5.2% |
| Other values (70) | 14510055 |
None
| Value | Count | Frequency (%) |
| ë | 184 | |
| ö | 38 | 12.2% |
| ü | 28 | 9.0% |
| á | 20 | 6.4% |
| Á | 16 | 5.1% |
| é | 11 | 3.5% |
| ó | 4 | 1.3% |
| × | 3 | 1.0% |
| É | 2 | 0.6% |
| ñ | 2 | 0.6% |
| Other values (4) | 4 | 1.3% |
typifiedName
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361471 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 10.5 |
| Mean length | 10.5 |
| Min length | 8 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | French Guiana |
|---|---|
| 2nd row | Malvales |
| Value | Count | Frequency (%) |
| french | 1 | |
| guiana | 1 | |
| malvales | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4 | |
| e | 2 | 9.5% |
| n | 2 | 9.5% |
| l | 2 | 9.5% |
| F | 1 | 4.8% |
| r | 1 | 4.8% |
| c | 1 | 4.8% |
| h | 1 | 4.8% |
| 1 | 4.8% | |
| G | 1 | 4.8% |
| Other values (5) | 5 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 17 | |
| Uppercase Letter | 3 | 14.3% |
| Space Separator | 1 | 4.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4 | |
| e | 2 | |
| n | 2 | |
| l | 2 | |
| r | 1 | 5.9% |
| c | 1 | 5.9% |
| h | 1 | 5.9% |
| u | 1 | 5.9% |
| i | 1 | 5.9% |
| v | 1 | 5.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 1 | |
| G | 1 | |
| M | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 20 | |
| Common | 1 | 4.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4 | |
| e | 2 | |
| n | 2 | |
| l | 2 | |
| F | 1 | 5.0% |
| r | 1 | 5.0% |
| c | 1 | 5.0% |
| h | 1 | 5.0% |
| G | 1 | 5.0% |
| u | 1 | 5.0% |
| Other values (4) | 4 |
Common
| Value | Count | Frequency (%) |
| 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4 | |
| e | 2 | 9.5% |
| n | 2 | 9.5% |
| l | 2 | 9.5% |
| F | 1 | 4.8% |
| r | 1 | 4.8% |
| c | 1 | 4.8% |
| h | 1 | 4.8% |
| 1 | 4.8% | |
| G | 1 | 4.8% |
| Other values (5) | 5 |
protocol
Text
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 11 |
| Missing (%) | < 0.1% |
| Memory size | 18.0 MiB |
Length
| Max length | 48 |
|---|---|
| Median length | 3 |
| Mean length | 3.00002075 |
| Min length | 3 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | EML |
|---|---|
| 2nd row | EML |
| 3rd row | EML |
| 4th row | EML |
| 5th row | EML |
| Value | Count | Frequency (%) |
| eml | 2361460 | |
| guf.1_1 | 1 | < 0.1% |
| occurrence_status_inferred_from_individual_count | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 2361464 | |
| L | 2361461 | |
| M | 2361461 | |
| _ | 6 | < 0.1% |
| R | 5 | < 0.1% |
| U | 5 | < 0.1% |
| I | 4 | < 0.1% |
| N | 4 | < 0.1% |
| C | 4 | < 0.1% |
| D | 3 | < 0.1% |
| Other values (9) | 18 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 7084426 | |
| Connector Punctuation | 6 | < 0.1% |
| Decimal Number | 2 | < 0.1% |
| Other Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 2361464 | |
| L | 2361461 | |
| M | 2361461 | |
| R | 5 | < 0.1% |
| U | 5 | < 0.1% |
| I | 4 | < 0.1% |
| N | 4 | < 0.1% |
| C | 4 | < 0.1% |
| D | 3 | < 0.1% |
| T | 3 | < 0.1% |
| Other values (6) | 12 | < 0.1% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 6 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7084426 | |
| Common | 9 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 2361464 | |
| L | 2361461 | |
| M | 2361461 | |
| R | 5 | < 0.1% |
| U | 5 | < 0.1% |
| I | 4 | < 0.1% |
| N | 4 | < 0.1% |
| C | 4 | < 0.1% |
| D | 3 | < 0.1% |
| T | 3 | < 0.1% |
| Other values (6) | 12 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| _ | 6 | |
| 1 | 2 | 22.2% |
| . | 1 | 11.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7084435 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 2361464 | |
| L | 2361461 | |
| M | 2361461 | |
| _ | 6 | < 0.1% |
| R | 5 | < 0.1% |
| U | 5 | < 0.1% |
| I | 4 | < 0.1% |
| N | 4 | < 0.1% |
| C | 4 | < 0.1% |
| D | 3 | < 0.1% |
| Other values (9) | 18 | < 0.1% |
lastParsed
Text
| Distinct | 210769 |
|---|---|
| Distinct (%) | 8.9% |
| Missing | 4 |
| Missing (%) | < 0.1% |
| Memory size | 18.0 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 23.99603679 |
| Min length | 7 |
Unique
| Unique | 7665 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | 2024-12-02T13:59:36.683Z |
|---|---|
| 2nd row | 2024-12-02T13:59:14.817Z |
| 3rd row | 2024-12-02T13:57:42.802Z |
| 4th row | 2024-12-02T13:59:13.837Z |
| 5th row | 2024-12-02T13:57:45.358Z |
| Value | Count | Frequency (%) |
| 2024-12-02t13:57:25.039z | 46 | < 0.1% |
| 2024-12-02t13:57:24.083z | 45 | < 0.1% |
| 2024-12-02t13:57:45.003z | 45 | < 0.1% |
| 2024-12-02t13:57:28.833z | 45 | < 0.1% |
| 2024-12-02t13:57:34.491z | 44 | < 0.1% |
| 2024-12-02t13:57:52.915z | 44 | < 0.1% |
| 2024-12-02t13:57:52.924z | 43 | < 0.1% |
| 2024-12-02t13:57:43.166z | 43 | < 0.1% |
| 2024-12-02t13:57:52.893z | 42 | < 0.1% |
| 2024-12-02t13:57:42.743z | 42 | < 0.1% |
| Other values (210759) | 2361030 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 10789973 | |
| 0 | 5989862 | |
| 1 | 5962965 | |
| : | 4722920 | |
| - | 4722920 | |
| 4 | 3794952 | 6.7% |
| 5 | 3740547 | 6.6% |
| 3 | 3738231 | 6.6% |
| T | 2361460 | 4.2% |
| Z | 2361460 | 4.2% |
| Other values (32) | 8480607 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 40137896 | |
| Other Punctuation | 7082072 | 12.5% |
| Uppercase Letter | 4722930 | 8.3% |
| Dash Punctuation | 4722920 | 8.3% |
| Lowercase Letter | 79 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 10 | |
| e | 10 | |
| i | 9 | |
| n | 7 | |
| c | 6 | |
| u | 5 | 6.3% |
| l | 5 | 6.3% |
| r | 5 | 6.3% |
| t | 4 | 5.1% |
| m | 4 | 5.1% |
| Other values (9) | 14 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 10789973 | |
| 0 | 5989862 | |
| 1 | 5962965 | |
| 4 | 3794952 | 9.5% |
| 5 | 3740547 | 9.3% |
| 3 | 3738231 | 9.3% |
| 7 | 1827732 | 4.6% |
| 9 | 1516897 | 3.8% |
| 6 | 1417013 | 3.5% |
| 8 | 1359724 | 3.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 2361460 | |
| Z | 2361460 | |
| M | 2 | < 0.1% |
| P | 2 | < 0.1% |
| H | 1 | < 0.1% |
| U | 1 | < 0.1% |
| C | 1 | < 0.1% |
| S | 1 | < 0.1% |
| L | 1 | < 0.1% |
| I | 1 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 4722920 | |
| . | 2359152 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4722920 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 51942888 | |
| Latin | 4723009 | 8.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| T | 2361460 | |
| Z | 2361460 | |
| a | 10 | < 0.1% |
| e | 10 | < 0.1% |
| i | 9 | < 0.1% |
| n | 7 | < 0.1% |
| c | 6 | < 0.1% |
| u | 5 | < 0.1% |
| l | 5 | < 0.1% |
| r | 5 | < 0.1% |
| Other values (19) | 32 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 2 | 10789973 | |
| 0 | 5989862 | |
| 1 | 5962965 | |
| : | 4722920 | |
| - | 4722920 | |
| 4 | 3794952 | 7.3% |
| 5 | 3740547 | 7.2% |
| 3 | 3738231 | 7.2% |
| . | 2359152 | 4.5% |
| 7 | 1827732 | 3.5% |
| Other values (3) | 4293634 | 8.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 56665897 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 10789973 | |
| 0 | 5989862 | |
| 1 | 5962965 | |
| : | 4722920 | |
| - | 4722920 | |
| 4 | 3794952 | 6.7% |
| 5 | 3740547 | 6.6% |
| 3 | 3738231 | 6.6% |
| T | 2361460 | 4.2% |
| Z | 2361460 | 4.2% |
| Other values (32) | 8480607 |
lastCrawled
Text
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5 |
| Missing (%) | < 0.1% |
| Memory size | 18.0 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 23.99995045 |
| Min length | 5 |
Unique
| Unique | 8 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2024-12-02T11:48:23.416Z |
|---|---|
| 2nd row | 2024-12-02T11:48:23.416Z |
| 3rd row | 2024-12-02T11:48:23.416Z |
| 4th row | 2024-12-02T11:48:23.416Z |
| 5th row | 2024-12-02T11:48:23.416Z |
| Value | Count | Frequency (%) |
| 2024-12-02t11:48:23.416z | 2361460 | |
| uncinaria | 1 | < 0.1% |
| haemoproteus | 1 | < 0.1% |
| phyllobothrium | 1 | < 0.1% |
| guf.1.11_1 | 1 | < 0.1% |
| distomum | 1 | < 0.1% |
| senecio | 1 | < 0.1% |
| false | 1 | < 0.1% |
| merluccius | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 11807300 | |
| 1 | 9445844 | |
| 4 | 7084380 | |
| - | 4722920 | 8.3% |
| : | 4722920 | 8.3% |
| 0 | 4722920 | 8.3% |
| . | 2361462 | 4.2% |
| T | 2361460 | 4.2% |
| 8 | 2361460 | 4.2% |
| 3 | 2361460 | 4.2% |
| Other values (28) | 4722989 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 40144824 | |
| Other Punctuation | 7084382 | 12.5% |
| Uppercase Letter | 4722929 | 8.3% |
| Dash Punctuation | 4722920 | 8.3% |
| Lowercase Letter | 59 | < 0.1% |
| Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 6 | |
| o | 6 | |
| e | 6 | |
| u | 5 | |
| c | 4 | 6.8% |
| r | 4 | 6.8% |
| m | 4 | 6.8% |
| s | 4 | 6.8% |
| a | 4 | 6.8% |
| l | 4 | 6.8% |
| Other values (7) | 12 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 2361460 | |
| Z | 2361460 | |
| U | 2 | < 0.1% |
| F | 1 | < 0.1% |
| S | 1 | < 0.1% |
| D | 1 | < 0.1% |
| P | 1 | < 0.1% |
| G | 1 | < 0.1% |
| H | 1 | < 0.1% |
| M | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 11807300 | |
| 1 | 9445844 | |
| 4 | 7084380 | |
| 0 | 4722920 | 11.8% |
| 8 | 2361460 | 5.9% |
| 3 | 2361460 | 5.9% |
| 6 | 2361460 | 5.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 4722920 | |
| . | 2361462 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4722920 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 51952127 | |
| Latin | 4722988 | 8.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| T | 2361460 | |
| Z | 2361460 | |
| i | 6 | < 0.1% |
| o | 6 | < 0.1% |
| e | 6 | < 0.1% |
| u | 5 | < 0.1% |
| c | 4 | < 0.1% |
| r | 4 | < 0.1% |
| m | 4 | < 0.1% |
| s | 4 | < 0.1% |
| Other values (17) | 29 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 2 | 11807300 | |
| 1 | 9445844 | |
| 4 | 7084380 | |
| - | 4722920 | 9.1% |
| : | 4722920 | 9.1% |
| 0 | 4722920 | 9.1% |
| . | 2361462 | 4.5% |
| 8 | 2361460 | 4.5% |
| 3 | 2361460 | 4.5% |
| 6 | 2361460 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 56675115 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 11807300 | |
| 1 | 9445844 | |
| 4 | 7084380 | |
| - | 4722920 | 8.3% |
| : | 4722920 | 8.3% |
| 0 | 4722920 | 8.3% |
| . | 2361462 | 4.2% |
| T | 2361460 | 4.2% |
| 8 | 2361460 | 4.2% |
| 3 | 2361460 | 4.2% |
| Other values (28) | 4722989 |
repatriated
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 92313 |
| Missing (%) | 3.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 4 |
| Mean length | 4.372713251 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | true |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| true | 1423419 | |
| false | 845740 | |
| saint-elie | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2269160 | |
| t | 1423420 | |
| r | 1423419 | |
| u | 1423419 | |
| a | 845741 | 8.5% |
| l | 845741 | 8.5% |
| f | 845740 | 8.5% |
| s | 845740 | 8.5% |
| i | 2 | < 0.1% |
| S | 1 | < 0.1% |
| Other values (3) | 3 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9922383 | |
| Uppercase Letter | 2 | < 0.1% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2269160 | |
| t | 1423420 | |
| r | 1423419 | |
| u | 1423419 | |
| a | 845741 | 8.5% |
| l | 845741 | 8.5% |
| f | 845740 | 8.5% |
| s | 845740 | 8.5% |
| i | 2 | < 0.1% |
| n | 1 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1 | |
| E | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9922385 | |
| Common | 1 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2269160 | |
| t | 1423420 | |
| r | 1423419 | |
| u | 1423419 | |
| a | 845741 | 8.5% |
| l | 845741 | 8.5% |
| f | 845740 | 8.5% |
| s | 845740 | 8.5% |
| i | 2 | < 0.1% |
| S | 1 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9922386 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 2269160 | |
| t | 1423420 | |
| r | 1423419 | |
| u | 1423419 | |
| a | 845741 | 8.5% |
| l | 845741 | 8.5% |
| f | 845740 | 8.5% |
| s | 845740 | 8.5% |
| i | 2 | < 0.1% |
| S | 1 | < 0.1% |
| Other values (3) | 3 | < 0.1% |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361472 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 3034046 |
|---|
| Value | Count | Frequency (%) |
| 3034046 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 2 | |
| 0 | 2 | |
| 4 | 2 | |
| 6 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 2 | |
| 0 | 2 | |
| 4 | 2 | |
| 6 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 2 | |
| 0 | 2 | |
| 4 | 2 | |
| 6 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 2 | |
| 0 | 2 | |
| 4 | 2 | |
| 6 | 1 |
projectId
Text
Missing 
| Distinct | 6 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 2361467 |
| Missing (%) | > 99.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 9 |
| Mean length | 8 |
| Min length | 5 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | caudatum |
|---|---|
| 2nd row | vibex |
| 3rd row | Sphaeralcea |
| 4th row | blumeri |
| 5th row | 3034046 |
| Value | Count | Frequency (%) |
| caudatum | 1 | |
| vibex | 1 | |
| sphaeralcea | 1 | |
| blumeri | 1 | |
| 3034046 | 1 | |
| bilinearis | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 6 | |
| i | 5 | 10.4% |
| e | 5 | 10.4% |
| r | 3 | 6.2% |
| u | 3 | 6.2% |
| b | 3 | 6.2% |
| l | 3 | 6.2% |
| c | 2 | 4.2% |
| m | 2 | 4.2% |
| 4 | 2 | 4.2% |
| Other values (12) | 14 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 40 | |
| Decimal Number | 7 | 14.6% |
| Uppercase Letter | 1 | 2.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 6 | |
| i | 5 | |
| e | 5 | |
| r | 3 | |
| u | 3 | |
| b | 3 | |
| l | 3 | |
| c | 2 | 5.0% |
| m | 2 | 5.0% |
| n | 1 | 2.5% |
| Other values (7) | 7 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 2 | |
| 0 | 2 | |
| 3 | 2 | |
| 6 | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 41 | |
| Common | 7 | 14.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 6 | |
| i | 5 | |
| e | 5 | |
| r | 3 | 7.3% |
| u | 3 | 7.3% |
| b | 3 | 7.3% |
| l | 3 | 7.3% |
| c | 2 | 4.9% |
| m | 2 | 4.9% |
| n | 1 | 2.4% |
| Other values (8) | 8 |
Common
| Value | Count | Frequency (%) |
| 4 | 2 | |
| 0 | 2 | |
| 3 | 2 | |
| 6 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 48 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 6 | |
| i | 5 | 10.4% |
| e | 5 | 10.4% |
| r | 3 | 6.2% |
| u | 3 | 6.2% |
| b | 3 | 6.2% |
| l | 3 | 6.2% |
| c | 2 | 4.2% |
| m | 2 | 4.2% |
| 4 | 2 | 4.2% |
| Other values (12) | 14 |
isSequenced
Text
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 10 |
| Missing (%) | < 0.1% |
| Memory size | 18.0 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 5 |
| Mean length | 4.998686408 |
| Min length | 1 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| false | 2358359 | |
| true | 3101 | 0.1% |
| lc | 1 | < 0.1% |
| sphaeralcea | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2361462 | |
| a | 2358362 | |
| l | 2358360 | |
| f | 2358359 | |
| s | 2358359 | |
| r | 3102 | < 0.1% |
| t | 3101 | < 0.1% |
| u | 3101 | < 0.1% |
| L | 1 | < 0.1% |
| C | 1 | < 0.1% |
| Other values (5) | 5 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11804209 | |
| Uppercase Letter | 3 | < 0.1% |
| Decimal Number | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2361462 | |
| a | 2358362 | |
| l | 2358360 | |
| f | 2358359 | |
| s | 2358359 | |
| r | 3102 | < 0.1% |
| t | 3101 | < 0.1% |
| u | 3101 | < 0.1% |
| p | 1 | < 0.1% |
| h | 1 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 1 | |
| C | 1 | |
| S | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11804212 | |
| Common | 1 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2361462 | |
| a | 2358362 | |
| l | 2358360 | |
| f | 2358359 | |
| s | 2358359 | |
| r | 3102 | < 0.1% |
| t | 3101 | < 0.1% |
| u | 3101 | < 0.1% |
| L | 1 | < 0.1% |
| C | 1 | < 0.1% |
| Other values (4) | 4 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 6 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11804213 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 2361462 | |
| a | 2358362 | |
| l | 2358360 | |
| f | 2358359 | |
| s | 2358359 | |
| r | 3102 | < 0.1% |
| t | 3101 | < 0.1% |
| u | 3101 | < 0.1% |
| L | 1 | < 0.1% |
| C | 1 | < 0.1% |
| Other values (5) | 5 | < 0.1% |
gbifRegion
Text
Missing 
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 114374 |
| Missing (%) | 4.8% |
| Memory size | 18.0 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 10.97984156 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | LATIN_AMERICA |
|---|---|
| 2nd row | NORTH_AMERICA |
| 3rd row | NORTH_AMERICA |
| 4th row | NORTH_AMERICA |
| 5th row | NORTH_AMERICA |
| Value | Count | Frequency (%) |
| north_america | 899421 | |
| latin_america | 745190 | |
| asia | 257467 | 11.5% |
| oceania | 127573 | 5.7% |
| africa | 108539 | 4.8% |
| europe | 92588 | 4.1% |
| antarctica | 16320 | 0.7% |
| 7707728 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 5070530 | |
| I | 2899700 | |
| R | 2761479 | |
| E | 1957360 | 7.9% |
| C | 1913363 | 7.8% |
| N | 1788504 | 7.2% |
| T | 1677251 | 6.8% |
| _ | 1644611 | 6.7% |
| M | 1644611 | 6.7% |
| O | 1119582 | 4.5% |
| Other values (10) | 2195800 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 23028173 | |
| Connector Punctuation | 1644611 | 6.7% |
| Decimal Number | 7 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 5070530 | |
| I | 2899700 | |
| R | 2761479 | |
| E | 1957360 | 8.5% |
| C | 1913363 | 8.3% |
| N | 1788504 | 7.8% |
| T | 1677251 | 7.3% |
| M | 1644611 | 7.1% |
| O | 1119582 | 4.9% |
| H | 899421 | 3.9% |
| Other values (5) | 1296372 | 5.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 4 | |
| 0 | 1 | 14.3% |
| 2 | 1 | 14.3% |
| 8 | 1 | 14.3% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1644611 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23028173 | |
| Common | 1644618 | 6.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 5070530 | |
| I | 2899700 | |
| R | 2761479 | |
| E | 1957360 | 8.5% |
| C | 1913363 | 8.3% |
| N | 1788504 | 7.8% |
| T | 1677251 | 7.3% |
| M | 1644611 | 7.1% |
| O | 1119582 | 4.9% |
| H | 899421 | 3.9% |
| Other values (5) | 1296372 | 5.6% |
Common
| Value | Count | Frequency (%) |
| _ | 1644611 | |
| 7 | 4 | < 0.1% |
| 0 | 1 | < 0.1% |
| 2 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 24672791 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 5070530 | |
| I | 2899700 | |
| R | 2761479 | |
| E | 1957360 | 7.9% |
| C | 1913363 | 7.8% |
| N | 1788504 | 7.2% |
| T | 1677251 | 6.8% |
| _ | 1644611 | 6.7% |
| M | 1644611 | 6.7% |
| O | 1119582 | 4.5% |
| Other values (10) | 2195800 |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 6 |
| Missing (%) | < 0.1% |
| Memory size | 18.0 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 12.99997883 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | NORTH_AMERICA |
|---|---|
| 2nd row | NORTH_AMERICA |
| 3rd row | NORTH_AMERICA |
| 4th row | NORTH_AMERICA |
| 5th row | NORTH_AMERICA |
| Value | Count | Frequency (%) |
| north_america | 2361460 | |
| species | 4 | < 0.1% |
| genus | 2 | < 0.1% |
| 220 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 4722920 | |
| R | 4722920 | |
| E | 2361470 | |
| C | 2361464 | |
| I | 2361464 | |
| N | 2361462 | |
| _ | 2361460 | |
| M | 2361460 | |
| O | 2361460 | |
| H | 2361460 | |
| Other values (7) | 2361481 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 28337558 | |
| Connector Punctuation | 2361460 | 7.7% |
| Decimal Number | 3 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 4722920 | |
| R | 4722920 | |
| E | 2361470 | |
| C | 2361464 | |
| I | 2361464 | |
| N | 2361462 | |
| M | 2361460 | |
| O | 2361460 | |
| H | 2361460 | |
| T | 2361460 | |
| Other values (4) | 18 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 0 | 1 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2361460 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 28337558 | |
| Common | 2361463 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 4722920 | |
| R | 4722920 | |
| E | 2361470 | |
| C | 2361464 | |
| I | 2361464 | |
| N | 2361462 | |
| M | 2361460 | |
| O | 2361460 | |
| H | 2361460 | |
| T | 2361460 | |
| Other values (4) | 18 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| _ | 2361460 | |
| 2 | 2 | < 0.1% |
| 0 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 30699021 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 4722920 | |
| R | 4722920 | |
| E | 2361470 | |
| C | 2361464 | |
| I | 2361464 | |
| N | 2361462 | |
| _ | 2361460 | |
| M | 2361460 | |
| O | 2361460 | |
| H | 2361460 | |
| Other values (7) | 2361481 |
level0Gid
Text
Missing 
| Distinct | 239 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1911133 |
| Missing (%) | 80.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 3 |
| Mean length | 3.000008882 |
| Min length | 3 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | USA |
|---|---|
| 2nd row | USA |
| 3rd row | USA |
| 4th row | USA |
| 5th row | CRI |
| Value | Count | Frequency (%) |
| usa | 191983 | |
| ven | 20119 | 4.5% |
| bra | 20003 | 4.4% |
| guy | 18195 | 4.0% |
| mex | 16475 | 3.7% |
| ecu | 12944 | 2.9% |
| per | 9844 | 2.2% |
| can | 9240 | 2.1% |
| pan | 6074 | 1.3% |
| bol | 6000 | 1.3% |
| Other values (229) | 139463 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 260823 | |
| U | 244099 | |
| S | 209561 | |
| E | 69205 | 5.1% |
| N | 67199 | 5.0% |
| R | 56601 | 4.2% |
| C | 47627 | 3.5% |
| G | 44616 | 3.3% |
| M | 43317 | 3.2% |
| B | 38463 | 2.8% |
| Other values (28) | 269513 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1350250 | |
| Decimal Number | 767 | 0.1% |
| Lowercase Letter | 7 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 260823 | |
| U | 244099 | |
| S | 209561 | |
| E | 69205 | 5.1% |
| N | 67199 | 5.0% |
| R | 56601 | 4.2% |
| C | 47627 | 3.5% |
| G | 44616 | 3.3% |
| M | 43317 | 3.2% |
| B | 38463 | 2.8% |
| Other values (16) | 268739 |
Lowercase Letter
| Value | Count | Frequency (%) |
| p | 1 | |
| a | 1 | |
| l | 1 | |
| m | 1 | |
| e | 1 | |
| r | 1 | |
| i | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 383 | |
| 1 | 299 | |
| 6 | 75 | 9.8% |
| 7 | 9 | 1.2% |
| 4 | 1 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1350257 | |
| Common | 767 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 260823 | |
| U | 244099 | |
| S | 209561 | |
| E | 69205 | 5.1% |
| N | 67199 | 5.0% |
| R | 56601 | 4.2% |
| C | 47627 | 3.5% |
| G | 44616 | 3.3% |
| M | 43317 | 3.2% |
| B | 38463 | 2.8% |
| Other values (23) | 268746 |
Common
| Value | Count | Frequency (%) |
| 0 | 383 | |
| 1 | 299 | |
| 6 | 75 | 9.8% |
| 7 | 9 | 1.2% |
| 4 | 1 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1351024 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 260823 | |
| U | 244099 | |
| S | 209561 | |
| E | 69205 | 5.1% |
| N | 67199 | 5.0% |
| R | 56601 | 4.2% |
| C | 47627 | 3.5% |
| G | 44616 | 3.3% |
| M | 43317 | 3.2% |
| B | 38463 | 2.8% |
| Other values (28) | 269513 |
level0Name
Text
Missing 
| Distinct | 238 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1911134 |
| Missing (%) | 80.9% |
| Memory size | 18.0 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 30 |
| Mean length | 10.17422653 |
| Min length | 4 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | United States |
|---|---|
| 2nd row | United States |
| 3rd row | United States |
| 4th row | United States |
| 5th row | Costa Rica |
| Value | Count | Frequency (%) |
| united | 193094 | |
| states | 192247 | |
| venezuela | 20119 | 2.9% |
| brazil | 20003 | 2.9% |
| guyana | 18195 | 2.6% |
| méxico | 16475 | 2.4% |
| ecuador | 12944 | 1.9% |
| peru | 9844 | 1.4% |
| canada | 9240 | 1.3% |
| french | 6658 | 1.0% |
| Other values (277) | 197267 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 615456 | |
| a | 532048 | |
| e | 528246 | |
| i | 371986 | 8.1% |
| n | 350657 | 7.7% |
| 245747 | 5.4% | |
| d | 242883 | 5.3% |
| s | 236258 | 5.2% |
| S | 209349 | 4.6% |
| U | 194325 | 4.2% |
| Other values (55) | 1054896 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3641450 | |
| Uppercase Letter | 692317 | 15.1% |
| Space Separator | 245747 | 5.4% |
| Other Punctuation | 2266 | < 0.1% |
| Open Punctuation | 23 | < 0.1% |
| Close Punctuation | 23 | < 0.1% |
| Dash Punctuation | 21 | < 0.1% |
| Decimal Number | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 615456 | |
| a | 532048 | |
| e | 528246 | |
| i | 371986 | |
| n | 350657 | |
| d | 242883 | 6.7% |
| s | 236258 | 6.5% |
| u | 115753 | 3.2% |
| o | 109573 | 3.0% |
| r | 103656 | 2.8% |
| Other values (21) | 434934 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 209349 | |
| U | 194325 | |
| G | 34228 | 4.9% |
| C | 34212 | 4.9% |
| B | 32848 | 4.7% |
| P | 30756 | 4.4% |
| M | 29201 | 4.2% |
| V | 21508 | 3.1% |
| E | 16779 | 2.4% |
| A | 15001 | 2.2% |
| Other values (15) | 74110 | 10.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 865 | |
| . | 756 | |
| , | 645 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 2 | |
| 8 | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 245747 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 23 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 23 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 21 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4333767 | |
| Common | 248084 | 5.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 615456 | |
| a | 532048 | |
| e | 528246 | |
| i | 371986 | 8.6% |
| n | 350657 | 8.1% |
| d | 242883 | 5.6% |
| s | 236258 | 5.5% |
| S | 209349 | 4.8% |
| U | 194325 | 4.5% |
| u | 115753 | 2.7% |
| Other values (46) | 936806 |
Common
| Value | Count | Frequency (%) |
| 245747 | ||
| ' | 865 | 0.3% |
| . | 756 | 0.3% |
| , | 645 | 0.3% |
| ( | 23 | < 0.1% |
| ) | 23 | < 0.1% |
| - | 21 | < 0.1% |
| 6 | 2 | < 0.1% |
| 8 | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4563119 | |
| None | 18732 | 0.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 615456 | |
| a | 532048 | |
| e | 528246 | |
| i | 371986 | 8.2% |
| n | 350657 | 7.7% |
| 245747 | 5.4% | |
| d | 242883 | 5.3% |
| s | 236258 | 5.2% |
| S | 209349 | 4.6% |
| U | 194325 | 4.3% |
| Other values (50) | 1036164 |
None
| Value | Count | Frequency (%) |
| é | 16737 | |
| ô | 865 | 4.6% |
| ç | 628 | 3.4% |
| í | 251 | 1.3% |
| ã | 251 | 1.3% |
level1Gid
Text
Missing 
| Distinct | 2569 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 1912772 |
| Missing (%) | 81.0% |
| Memory size | 18.0 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.592494779 |
| Min length | 6 |
Unique
| Unique | 311 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | USA.49_1 |
|---|---|
| 2nd row | USA.20_1 |
| 3rd row | USA.32_1 |
| 4th row | USA.38_1 |
| 5th row | CRI.2_1 |
| Value | Count | Frequency (%) |
| usa.47_1 | 28132 | 6.3% |
| usa.21_1 | 17503 | 3.9% |
| usa.34_1 | 16052 | 3.6% |
| usa.5_1 | 12531 | 2.8% |
| usa.10_1 | 9880 | 2.2% |
| usa.49_1 | 6455 | 1.4% |
| ven.1_1 | 6336 | 1.4% |
| usa.39_1 | 6190 | 1.4% |
| usa.6_1 | 5820 | 1.3% |
| usa.9_1 | 5812 | 1.3% |
| Other values (2559) | 333990 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 602900 | |
| _ | 448654 | |
| . | 446732 | |
| A | 259830 | 7.6% |
| U | 243468 | 7.1% |
| S | 209473 | 6.1% |
| 2 | 119917 | 3.5% |
| 4 | 112033 | 3.3% |
| 3 | 88307 | 2.6% |
| E | 69205 | 2.0% |
| Other values (28) | 806241 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1345474 | |
| Decimal Number | 1165900 | |
| Connector Punctuation | 448654 | 13.2% |
| Other Punctuation | 446732 | 13.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 259830 | |
| U | 243468 | |
| S | 209473 | |
| E | 69205 | 5.1% |
| N | 67135 | 5.0% |
| R | 56358 | 4.2% |
| C | 46945 | 3.5% |
| G | 44614 | 3.3% |
| M | 43301 | 3.2% |
| B | 38446 | 2.9% |
| Other values (16) | 266699 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 602900 | |
| 2 | 119917 | 10.3% |
| 4 | 112033 | 9.6% |
| 3 | 88307 | 7.6% |
| 5 | 48668 | 4.2% |
| 7 | 48244 | 4.1% |
| 9 | 42418 | 3.6% |
| 6 | 37250 | 3.2% |
| 8 | 33151 | 2.8% |
| 0 | 33012 | 2.8% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 448654 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 446732 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2061286 | |
| Latin | 1345474 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 259830 | |
| U | 243468 | |
| S | 209473 | |
| E | 69205 | 5.1% |
| N | 67135 | 5.0% |
| R | 56358 | 4.2% |
| C | 46945 | 3.5% |
| G | 44614 | 3.3% |
| M | 43301 | 3.2% |
| B | 38446 | 2.9% |
| Other values (16) | 266699 |
Common
| Value | Count | Frequency (%) |
| 1 | 602900 | |
| _ | 448654 | |
| . | 446732 | |
| 2 | 119917 | 5.8% |
| 4 | 112033 | 5.4% |
| 3 | 88307 | 4.3% |
| 5 | 48668 | 2.4% |
| 7 | 48244 | 2.3% |
| 9 | 42418 | 2.1% |
| 6 | 37250 | 1.8% |
| Other values (2) | 66163 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3406760 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 602900 | |
| _ | 448654 | |
| . | 446732 | |
| A | 259830 | 7.6% |
| U | 243468 | 7.1% |
| S | 209473 | 6.1% |
| 2 | 119917 | 3.5% |
| 4 | 112033 | 3.3% |
| 3 | 88307 | 2.6% |
| E | 69205 | 2.0% |
| Other values (28) | 806241 |
level1Name
Text
Missing 
| Distinct | 2469 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 1912766 |
| Missing (%) | 81.0% |
| Memory size | 18.0 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 29 |
| Mean length | 9.529755497 |
| Min length | 3 |
Unique
| Unique | 305 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | West Virginia |
|---|---|
| 2nd row | Maine |
| 3rd row | New Mexico |
| 4th row | Oregon |
| 5th row | Cartago |
| Value | Count | Frequency (%) |
| virginia | 34587 | 5.8% |
| carolina | 18805 | 3.2% |
| maryland | 17505 | 2.9% |
| north | 16899 | 2.8% |
| california | 14263 | 2.4% |
| amazonas | 11119 | 1.9% |
| florida | 9889 | 1.7% |
| new | 9352 | 1.6% |
| columbia | 7231 | 1.2% |
| west | 7206 | 1.2% |
| Other values (2653) | 446841 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 626722 | |
| i | 390317 | 9.1% |
| n | 330354 | 7.7% |
| r | 306970 | 7.2% |
| o | 292322 | 6.8% |
| e | 218622 | 5.1% |
| s | 178051 | 4.2% |
| l | 167995 | 3.9% |
| t | 152746 | 3.6% |
| 144990 | 3.4% | |
| Other values (126) | 1466979 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3489310 | |
| Uppercase Letter | 604684 | 14.1% |
| Space Separator | 144990 | 3.4% |
| Dash Punctuation | 34622 | 0.8% |
| Other Punctuation | 2409 | 0.1% |
| Modifier Symbol | 47 | < 0.1% |
| Open Punctuation | 3 | < 0.1% |
| Close Punctuation | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 626722 | |
| i | 390317 | |
| n | 330354 | |
| r | 306970 | |
| o | 292322 | |
| e | 218622 | 6.3% |
| s | 178051 | 5.1% |
| l | 167995 | 4.8% |
| t | 152746 | 4.4% |
| u | 136092 | 3.9% |
| Other values (76) | 689119 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 85346 | |
| M | 61462 | 10.2% |
| S | 45740 | 7.6% |
| N | 43998 | 7.3% |
| A | 41807 | 6.9% |
| V | 40147 | 6.6% |
| P | 34481 | 5.7% |
| T | 33279 | 5.5% |
| B | 24460 | 4.0% |
| O | 20393 | 3.4% |
| Other values (30) | 173571 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 1289 | |
| . | 405 | 16.8% |
| ! | 336 | 13.9% |
| / | 270 | 11.2% |
| , | 109 | 4.5% |
Space Separator
| Value | Count | Frequency (%) |
| 144990 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 34622 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 47 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 3 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4093994 | |
| Common | 182074 | 4.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 626722 | |
| i | 390317 | 9.5% |
| n | 330354 | 8.1% |
| r | 306970 | 7.5% |
| o | 292322 | 7.1% |
| e | 218622 | 5.3% |
| s | 178051 | 4.3% |
| l | 167995 | 4.1% |
| t | 152746 | 3.7% |
| u | 136092 | 3.3% |
| Other values (116) | 1293803 |
Common
| Value | Count | Frequency (%) |
| 144990 | ||
| - | 34622 | 19.0% |
| ' | 1289 | 0.7% |
| . | 405 | 0.2% |
| ! | 336 | 0.2% |
| / | 270 | 0.1% |
| , | 109 | 0.1% |
| ` | 47 | < 0.1% |
| [ | 3 | < 0.1% |
| ] | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4231633 | |
| None | 44134 | 1.0% |
| Latin Ext Additional | 301 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 626722 | |
| i | 390317 | 9.2% |
| n | 330354 | 7.8% |
| r | 306970 | 7.3% |
| o | 292322 | 6.9% |
| e | 218622 | 5.2% |
| s | 178051 | 4.2% |
| l | 167995 | 4.0% |
| t | 152746 | 3.6% |
| 144990 | 3.4% | |
| Other values (52) | 1422544 |
None
| Value | Count | Frequency (%) |
| í | 11066 | |
| á | 11012 | |
| é | 7746 | |
| ó | 5432 | |
| ã | 2198 | 5.0% |
| Î | 1381 | 3.1% |
| ô | 965 | 2.2% |
| ü | 753 | 1.7% |
| ñ | 686 | 1.6% |
| â | 598 | 1.4% |
| Other values (49) | 2297 | 5.2% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ồ | 76 | |
| ậ | 41 | |
| ẵ | 37 | |
| ạ | 27 | 9.0% |
| ắ | 27 | 9.0% |
| ả | 23 | 7.6% |
| ộ | 20 | 6.6% |
| ệ | 17 | 5.6% |
| ằ | 11 | 3.7% |
| ị | 7 | 2.3% |
| Other values (5) | 15 | 5.0% |
level2Gid
Text
Missing 
| Distinct | 14207 |
|---|---|
| Distinct (%) | 3.3% |
| Missing | 1927752 |
| Missing (%) | 81.6% |
| Memory size | 18.0 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 11 |
| Mean length | 10.17988753 |
| Min length | 7 |
Unique
| Unique | 3239 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | USA.49.42_1 |
|---|---|
| 2nd row | USA.20.10_1 |
| 3rd row | USA.32.8_1 |
| 4th row | USA.38.35_1 |
| 5th row | CRI.2.2_1 |
| Value | Count | Frequency (%) |
| usa.9.1_1 | 5812 | 1.3% |
| usa.21.15_1 | 4120 | 0.9% |
| usa.21.16_1 | 4057 | 0.9% |
| guy.8.8_1 | 3799 | 0.9% |
| usa.34.87_1 | 2754 | 0.6% |
| guy.2.8_1 | 2722 | 0.6% |
| guy.10.4_1 | 2607 | 0.6% |
| usa.47.40_1 | 2604 | 0.6% |
| usa.10.43_1 | 2474 | 0.6% |
| usa.47.50_1 | 2230 | 0.5% |
| Other values (14197) | 400542 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 865426 | |
| 1 | 701622 | |
| _ | 433720 | |
| A | 257502 | 5.8% |
| 2 | 256721 | 5.8% |
| U | 241816 | 5.5% |
| S | 207671 | 4.7% |
| 4 | 180297 | 4.1% |
| 3 | 160266 | 3.6% |
| 5 | 113447 | 2.6% |
| Other values (28) | 996743 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1815689 | |
| Uppercase Letter | 1300396 | |
| Other Punctuation | 865426 | |
| Connector Punctuation | 433720 | 9.8% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 257502 | |
| U | 241816 | |
| S | 207671 | |
| E | 69062 | 5.3% |
| N | 66514 | 5.1% |
| R | 52484 | 4.0% |
| C | 45749 | 3.5% |
| G | 42495 | 3.3% |
| M | 38963 | 3.0% |
| B | 35575 | 2.7% |
| Other values (16) | 242565 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 701622 | |
| 2 | 256721 | 14.1% |
| 4 | 180297 | 9.9% |
| 3 | 160266 | 8.8% |
| 5 | 113447 | 6.2% |
| 7 | 92449 | 5.1% |
| 6 | 87379 | 4.8% |
| 8 | 82143 | 4.5% |
| 9 | 73044 | 4.0% |
| 0 | 68321 | 3.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 865426 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 433720 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3114835 | |
| Latin | 1300396 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 257502 | |
| U | 241816 | |
| S | 207671 | |
| E | 69062 | 5.3% |
| N | 66514 | 5.1% |
| R | 52484 | 4.0% |
| C | 45749 | 3.5% |
| G | 42495 | 3.3% |
| M | 38963 | 3.0% |
| B | 35575 | 2.7% |
| Other values (16) | 242565 |
Common
| Value | Count | Frequency (%) |
| . | 865426 | |
| 1 | 701622 | |
| _ | 433720 | |
| 2 | 256721 | 8.2% |
| 4 | 180297 | 5.8% |
| 3 | 160266 | 5.1% |
| 5 | 113447 | 3.6% |
| 7 | 92449 | 3.0% |
| 6 | 87379 | 2.8% |
| 8 | 82143 | 2.6% |
| Other values (2) | 141365 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4415231 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 865426 | |
| 1 | 701622 | |
| _ | 433720 | |
| A | 257502 | 5.8% |
| 2 | 256721 | 5.8% |
| U | 241816 | 5.5% |
| S | 207671 | 4.7% |
| 4 | 180297 | 4.1% |
| 3 | 160266 | 3.6% |
| 5 | 113447 | 2.6% |
| Other values (28) | 996743 |
level2Name
Text
Missing 
| Distinct | 12304 |
|---|---|
| Distinct (%) | 2.8% |
| Missing | 1927850 |
| Missing (%) | 81.6% |
| Memory size | 18.0 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 28 |
| Mean length | 9.255454623 |
| Min length | 1 |
Unique
| Unique | 2893 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | Randolph |
|---|---|
| 2nd row | Penobscot |
| 3rd row | Dona Ana |
| 4th row | Wheeler |
| 5th row | Cartago |
| Value | Count | Frequency (%) |
| of | 16654 | 2.7% |
| rest | 9994 | 1.6% |
| region | 9986 | 1.6% |
| san | 8897 | 1.4% |
| de | 7619 | 1.2% |
| columbia | 6006 | 1.0% |
| district | 5938 | 1.0% |
| prince | 5814 | 0.9% |
| montgomery | 5411 | 0.9% |
| 4736 | 0.8% | |
| Other values (12309) | 538341 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 472400 | 11.8% |
| o | 308144 | 7.7% |
| e | 303215 | 7.6% |
| n | 287418 | 7.2% |
| i | 252180 | 6.3% |
| r | 237397 | 5.9% |
| 185773 | 4.6% | |
| t | 161195 | 4.0% |
| l | 160373 | 4.0% |
| s | 140991 | 3.5% |
| Other values (161) | 1504292 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3194580 | |
| Uppercase Letter | 582861 | 14.5% |
| Space Separator | 185773 | 4.6% |
| Dash Punctuation | 15579 | 0.4% |
| Decimal Number | 14276 | 0.4% |
| Other Punctuation | 14257 | 0.4% |
| Open Punctuation | 3087 | 0.1% |
| Close Punctuation | 1816 | < 0.1% |
| Math Symbol | 1131 | < 0.1% |
| Modifier Symbol | 18 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 472400 | |
| o | 308144 | |
| e | 303215 | |
| n | 287418 | 9.0% |
| i | 252180 | 7.9% |
| r | 237397 | 7.4% |
| t | 161195 | 5.0% |
| l | 160373 | 5.0% |
| s | 140991 | 4.4% |
| u | 135057 | 4.2% |
| Other values (91) | 736210 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 66713 | 11.4% |
| S | 55503 | 9.5% |
| M | 50752 | 8.7% |
| R | 40959 | 7.0% |
| P | 36962 | 6.3% |
| B | 35720 | 6.1% |
| A | 34671 | 5.9% |
| L | 26794 | 4.6% |
| G | 25252 | 4.3% |
| D | 24677 | 4.2% |
| Other values (37) | 184858 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 3930 | |
| 7 | 2826 | |
| 9 | 2732 | |
| 1 | 2623 | |
| 0 | 980 | 6.9% |
| 2 | 317 | 2.2% |
| 3 | 305 | 2.1% |
| 6 | 265 | 1.9% |
| 5 | 228 | 1.6% |
| 4 | 70 | 0.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 6045 | |
| . | 4259 | |
| / | 2092 | 14.7% |
| , | 1702 | 11.9% |
| & | 91 | 0.6% |
| ? | 62 | 0.4% |
| # | 6 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 185773 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 15579 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3087 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1816 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1131 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 18 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3777441 | |
| Common | 235937 | 5.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 472400 | 12.5% |
| o | 308144 | 8.2% |
| e | 303215 | 8.0% |
| n | 287418 | 7.6% |
| i | 252180 | 6.7% |
| r | 237397 | 6.3% |
| t | 161195 | 4.3% |
| l | 160373 | 4.2% |
| s | 140991 | 3.7% |
| u | 135057 | 3.6% |
| Other values (138) | 1319071 |
Common
| Value | Count | Frequency (%) |
| 185773 | ||
| - | 15579 | 6.6% |
| ' | 6045 | 2.6% |
| . | 4259 | 1.8% |
| 8 | 3930 | 1.7% |
| ( | 3087 | 1.3% |
| 7 | 2826 | 1.2% |
| 9 | 2732 | 1.2% |
| 1 | 2623 | 1.1% |
| / | 2092 | 0.9% |
| Other values (13) | 6991 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3963042 | |
| None | 50010 | 1.2% |
| Latin Ext Additional | 326 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 472400 | 11.9% |
| o | 308144 | 7.8% |
| e | 303215 | 7.7% |
| n | 287418 | 7.3% |
| i | 252180 | 6.4% |
| r | 237397 | 6.0% |
| 185773 | 4.7% | |
| t | 161195 | 4.1% |
| l | 160373 | 4.0% |
| s | 140991 | 3.6% |
| Other values (65) | 1453956 |
None
| Value | Count | Frequency (%) |
| ó | 9474 | |
| í | 9371 | |
| á | 9248 | |
| é | 9176 | |
| ã | 2578 | 5.2% |
| ñ | 2379 | 4.8% |
| ú | 1522 | 3.0% |
| ê | 1021 | 2.0% |
| ü | 956 | 1.9% |
| ç | 746 | 1.5% |
| Other values (63) | 3539 | 7.1% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ạ | 67 | |
| ả | 67 | |
| ế | 33 | |
| ờ | 29 | |
| ậ | 15 | 4.6% |
| ữ | 15 | 4.6% |
| ị | 15 | 4.6% |
| ủ | 14 | 4.3% |
| ồ | 13 | 4.0% |
| ợ | 12 | 3.7% |
| Other values (13) | 46 |
level3Gid
Text
Missing 
| Distinct | 8201 |
|---|---|
| Distinct (%) | 8.0% |
| Missing | 2259567 |
| Missing (%) | 95.7% |
| Memory size | 18.0 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 22 |
| Mean length | 11.78068023 |
| Min length | 11 |
Unique
| Unique | 2819 ? |
|---|---|
| Unique (%) | 2.8% |
Sample
| 1st row | CRI.2.2.4_1 |
|---|---|
| 2nd row | IND.19.16.3_1 |
| 3rd row | CHN.30.7.7_1 |
| 4th row | CRI.7.10.3_1 |
| 5th row | RUS.61.13.1_1 |
| Value | Count | Frequency (%) |
| can.13.1.35_1 | 1996 | 2.0% |
| per.18.3.4_1 | 1086 | 1.1% |
| per.8.9.1_1 | 918 | 0.9% |
| per.1.4.3_1 | 869 | 0.9% |
| pan.4.2.4_1 | 817 | 0.8% |
| pan.4.2.6_1 | 809 | 0.8% |
| mdg.2.1.5_1 | 704 | 0.7% |
| cri.5.2.1_1 | 568 | 0.6% |
| mdg.6.2.3_1 | 521 | 0.5% |
| per.18.1.1_1 | 500 | 0.5% |
| Other values (8193) | 93120 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 305698 | |
| 1 | 204681 | |
| _ | 101899 | 8.5% |
| 2 | 69213 | 5.8% |
| 3 | 45164 | 3.8% |
| 4 | 42456 | 3.5% |
| C | 35435 | 3.0% |
| E | 30901 | 2.6% |
| 5 | 30366 | 2.5% |
| A | 29431 | 2.5% |
| Other values (38) | 305278 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 487863 | |
| Other Punctuation | 305698 | |
| Uppercase Letter | 304935 | |
| Connector Punctuation | 101899 | 8.5% |
| Lowercase Letter | 101 | < 0.1% |
| Dash Punctuation | 24 | < 0.1% |
| Space Separator | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 35435 | |
| E | 30901 | 10.1% |
| A | 29431 | 9.7% |
| N | 29194 | 9.6% |
| R | 22473 | 7.4% |
| P | 21127 | 6.9% |
| H | 15472 | 5.1% |
| U | 15163 | 5.0% |
| L | 14604 | 4.8% |
| I | 13610 | 4.5% |
| Other values (13) | 77525 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 28 | |
| c | 25 | |
| b | 18 | |
| d | 12 | |
| e | 9 | 8.9% |
| r | 2 | 2.0% |
| i | 2 | 2.0% |
| l | 2 | 2.0% |
| s | 1 | 1.0% |
| m | 1 | 1.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 204681 | |
| 2 | 69213 | 14.2% |
| 3 | 45164 | 9.3% |
| 4 | 42456 | 8.7% |
| 5 | 30366 | 6.2% |
| 6 | 25988 | 5.3% |
| 8 | 22572 | 4.6% |
| 9 | 17190 | 3.5% |
| 7 | 16841 | 3.5% |
| 0 | 13392 | 2.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 305698 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 101899 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 24 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 895486 | |
| Latin | 305036 | 25.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 35435 | |
| E | 30901 | 10.1% |
| A | 29431 | 9.6% |
| N | 29194 | 9.6% |
| R | 22473 | 7.4% |
| P | 21127 | 6.9% |
| H | 15472 | 5.1% |
| U | 15163 | 5.0% |
| L | 14604 | 4.8% |
| I | 13610 | 4.5% |
| Other values (24) | 77626 |
Common
| Value | Count | Frequency (%) |
| . | 305698 | |
| 1 | 204681 | |
| _ | 101899 | 11.4% |
| 2 | 69213 | 7.7% |
| 3 | 45164 | 5.0% |
| 4 | 42456 | 4.7% |
| 5 | 30366 | 3.4% |
| 6 | 25988 | 2.9% |
| 8 | 22572 | 2.5% |
| 9 | 17190 | 1.9% |
| Other values (4) | 30259 | 3.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1200522 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 305698 | |
| 1 | 204681 | |
| _ | 101899 | 8.5% |
| 2 | 69213 | 5.8% |
| 3 | 45164 | 3.8% |
| 4 | 42456 | 3.5% |
| C | 35435 | 3.0% |
| E | 30901 | 2.6% |
| 5 | 30366 | 2.5% |
| A | 29431 | 2.5% |
| Other values (38) | 305278 |
level3Name
Text
Missing 
| Distinct | 7685 |
|---|---|
| Distinct (%) | 7.6% |
| Missing | 2260777 |
| Missing (%) | 95.7% |
| Memory size | 18.0 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 28 |
| Mean length | 10.13161397 |
| Min length | 2 |
Unique
| Unique | 2589 ? |
|---|---|
| Unique (%) | 2.6% |
Sample
| 1st row | Dulce Nombre |
|---|---|
| 2nd row | Kukshi |
| 3rd row | Lunan |
| 4th row | San Pedro |
| 5th row | Ban Luang |
| Value | Count | Frequency (%) |
| unorganized | 3367 | 2.2% |
| san | 3200 | 2.1% |
| de | 3074 | 2.0% |
| yukon | 1996 | 1.3% |
| el | 1944 | 1.2% |
| santa | 1489 | 1.0% |
| la | 1389 | 0.9% |
| rio | 1264 | 0.8% |
| no | 1168 | 0.8% |
| tambopata | 1086 | 0.7% |
| Other values (7998) | 135629 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 141798 | 13.9% |
| o | 73803 | 7.2% |
| n | 72305 | 7.1% |
| i | 64581 | 6.3% |
| e | 60134 | 5.9% |
| 54910 | 5.4% | |
| r | 52655 | 5.2% |
| u | 39382 | 3.9% |
| l | 35764 | 3.5% |
| t | 33716 | 3.3% |
| Other values (128) | 391165 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 787258 | |
| Uppercase Letter | 151480 | 14.8% |
| Space Separator | 54910 | 5.4% |
| Other Punctuation | 10121 | 1.0% |
| Decimal Number | 6298 | 0.6% |
| Open Punctuation | 4014 | 0.4% |
| Close Punctuation | 3325 | 0.3% |
| Dash Punctuation | 2796 | 0.3% |
| Final Punctuation | 11 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 141798 | |
| o | 73803 | 9.4% |
| n | 72305 | 9.2% |
| i | 64581 | 8.2% |
| e | 60134 | 7.6% |
| r | 52655 | 6.7% |
| u | 39382 | 5.0% |
| l | 35764 | 4.5% |
| t | 33716 | 4.3% |
| s | 25651 | 3.3% |
| Other values (72) | 187469 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 15511 | 10.2% |
| C | 14824 | 9.8% |
| B | 10184 | 6.7% |
| T | 9920 | 6.5% |
| P | 9627 | 6.4% |
| M | 9584 | 6.3% |
| A | 8905 | 5.9% |
| L | 7661 | 5.1% |
| N | 6992 | 4.6% |
| K | 6215 | 4.1% |
| Other values (24) | 52057 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1989 | |
| 2 | 877 | |
| 3 | 556 | 8.8% |
| 4 | 506 | 8.0% |
| 9 | 485 | 7.7% |
| 5 | 463 | 7.4% |
| 0 | 435 | 6.9% |
| 6 | 404 | 6.4% |
| 8 | 295 | 4.7% |
| 7 | 288 | 4.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4784 | |
| , | 4425 | |
| / | 358 | 3.5% |
| ' | 346 | 3.4% |
| ! | 191 | 1.9% |
| : | 11 | 0.1% |
| " | 6 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 54910 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 4014 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3325 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2796 |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 11 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 938738 | |
| Common | 81475 | 8.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 141798 | |
| o | 73803 | 7.9% |
| n | 72305 | 7.7% |
| i | 64581 | 6.9% |
| e | 60134 | 6.4% |
| r | 52655 | 5.6% |
| u | 39382 | 4.2% |
| l | 35764 | 3.8% |
| t | 33716 | 3.6% |
| s | 25651 | 2.7% |
| Other values (106) | 338949 |
Common
| Value | Count | Frequency (%) |
| 54910 | ||
| . | 4784 | 5.9% |
| , | 4425 | 5.4% |
| ( | 4014 | 4.9% |
| ) | 3325 | 4.1% |
| - | 2796 | 3.4% |
| 1 | 1989 | 2.4% |
| 2 | 877 | 1.1% |
| 3 | 556 | 0.7% |
| 4 | 506 | 0.6% |
| Other values (12) | 3293 | 4.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1010607 | |
| None | 9254 | 0.9% |
| Latin Ext Additional | 341 | < 0.1% |
| Punctuation | 11 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 141798 | 14.0% |
| o | 73803 | 7.3% |
| n | 72305 | 7.2% |
| i | 64581 | 6.4% |
| e | 60134 | 6.0% |
| 54910 | 5.4% | |
| r | 52655 | 5.2% |
| u | 39382 | 3.9% |
| l | 35764 | 3.5% |
| t | 33716 | 3.3% |
| Other values (63) | 381559 |
None
| Value | Count | Frequency (%) |
| á | 1795 | |
| é | 1690 | |
| ó | 1511 | |
| ñ | 1432 | |
| í | 927 | |
| ê | 371 | 4.0% |
| è | 288 | 3.1% |
| ü | 183 | 2.0% |
| à | 156 | 1.7% |
| â | 120 | 1.3% |
| Other values (31) | 781 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ả | 58 | |
| ế | 40 | |
| ờ | 30 | 8.8% |
| ọ | 24 | 7.0% |
| ậ | 22 | 6.5% |
| ỷ | 21 | 6.2% |
| ồ | 21 | 6.2% |
| ớ | 17 | 5.0% |
| ạ | 16 | 4.7% |
| ố | 16 | 4.7% |
| Other values (13) | 76 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 11 |
Missing 
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 383090 |
| Missing (%) | 16.2% |
| Memory size | 18.0 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 2 |
| Mean length | 2.000066721 |
| Min length | 2 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | NE |
|---|---|
| 2nd row | NE |
| 3rd row | LC |
| 4th row | NE |
| 5th row | NE |
| Value | Count | Frequency (%) |
| ne | 1310250 | |
| lc | 593364 | |
| vu | 24743 | 1.3% |
| nt | 19503 | 1.0% |
| en | 12442 | 0.6% |
| dd | 10871 | 0.5% |
| cr | 6368 | 0.3% |
| ex | 663 | < 0.1% |
| ew | 173 | < 0.1% |
| 2024-12-02t13:57:00.684z | 1 | < 0.1% |
| Other values (5) | 5 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 1342195 | |
| E | 1323528 | |
| C | 599732 | |
| L | 593364 | |
| V | 24743 | 0.6% |
| U | 24743 | 0.6% |
| D | 21742 | 0.5% |
| T | 19509 | 0.5% |
| R | 6368 | 0.2% |
| X | 663 | < 0.1% |
| Other values (15) | 311 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 3956766 | |
| Decimal Number | 102 | < 0.1% |
| Other Punctuation | 18 | < 0.1% |
| Dash Punctuation | 12 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1342195 | |
| E | 1323528 | |
| C | 599732 | |
| L | 593364 | |
| V | 24743 | 0.6% |
| U | 24743 | 0.6% |
| D | 21742 | 0.5% |
| T | 19509 | 0.5% |
| R | 6368 | 0.2% |
| X | 663 | < 0.1% |
| Other values (2) | 179 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 28 | |
| 0 | 16 | |
| 1 | 14 | |
| 5 | 11 | 10.8% |
| 3 | 10 | 9.8% |
| 4 | 8 | 7.8% |
| 8 | 5 | 4.9% |
| 6 | 4 | 3.9% |
| 7 | 3 | 2.9% |
| 9 | 3 | 2.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 12 | |
| . | 6 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 12 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3956766 | |
| Common | 132 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 28 | |
| 0 | 16 | |
| 1 | 14 | |
| - | 12 | |
| : | 12 | |
| 5 | 11 | 8.3% |
| 3 | 10 | 7.6% |
| 4 | 8 | 6.1% |
| . | 6 | 4.5% |
| 8 | 5 | 3.8% |
| Other values (3) | 10 | 7.6% |
Latin
| Value | Count | Frequency (%) |
| N | 1342195 | |
| E | 1323528 | |
| C | 599732 | |
| L | 593364 | |
| V | 24743 | 0.6% |
| U | 24743 | 0.6% |
| D | 21742 | 0.5% |
| T | 19509 | 0.5% |
| R | 6368 | 0.2% |
| X | 663 | < 0.1% |
| Other values (2) | 179 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3956898 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 1342195 | |
| E | 1323528 | |
| C | 599732 | |
| L | 593364 | |
| V | 24743 | 0.6% |
| U | 24743 | 0.6% |
| D | 21742 | 0.5% |
| T | 19509 | 0.5% |
| R | 6368 | 0.2% |
| X | 663 | < 0.1% |
| Other values (15) | 311 | < 0.1% |